A  D -  7  8  5  19  9 


EXTENDED  ARRAY  EVALUATION  PROGRAM. 
SPECIAL  REPORT  NO.  2.  SIMULATED  ON-LINE 
ADAPTIVE  PROCESSING  RESULTS  USING  ALASKA 
LONG  PERIOD  ARRAY  DATA 

Thom;  s  E .  Barnard 

Texas  Instruments,  Incorporated 


Prepared  for; 

Air  Force  Technical  Applications  Center 
Advanced  Researcli  Projects  Agency 


2 .3  October  1973 


DISTRIBUTED  BY: 


National  Technical  Information  Service 
U.  S.  DEPARTMENT  OF  COMMERCE 

5285  Port  Royal  Road,  Springfield  Va.  22151 


APPROVED  FOR  PUBLIC  RELEASE;  DISTRIBUTION  UNLIMITED 


SIMULATED  ON-LINE  ADAPTIVE  PROCESSING  RESULTS  USING  ALASKA 

LONG  PERIOD  ARRAY  DATA 


SPECIAL  REPORT  NO.  2 

EXTENDED  ARRAY  EVALUATION  PROGRAM 

Prepared  by 
Thomas  E.  Barnard 

TEXAS  INSTRUMENTS  INCORPORATED 
Equipment  Group 
Post  Office  Box  6015 
Dallas,  Texas  75222 


Prepared  for 

AIR  FORCE  TECHNICAL  APPLICATIONS  CENTER 
AFTAC  Project  No.  VELA  T/2705/B/ASD 
Alexandria,  Virginia  22314 


Sponsored  by 

ADVANCED  RESEARCH  PROJECTS  AGENCY 
Nuclear  Monitoring  Research  Office 
ARPA  Program  Code  No.  2F10 
ARPA  Order  No.  1714 


23  October  1973 


Acknowledgment:  This  research  was  supported  by  the  Advanced 
Research  Projects  Agency,  Nuclear  Monitoring  Research  Office, 
under  Project  VELA-UNIFORM,  and  accomplished  under  the  tech¬ 
nical  direction  of  the  Air  Force  Technical  Applications  Center 
under  Contract  No.  F33657-72-C-0725. 


Equipment  Group 


ACKNOWLEDGMENTS 


Several  people  have  contributed 


tion  of  this  report. 


in  various  ways  to  the  genera- 


Virtually  all  of  the  text  was  typed  by  Mrs.  Cherylann  Saunders 

Her  patience  with  numerous  changes  in  complicated  mathematical  equations 
IS  Sincerely  appreciated. 

Mr.  Charles  O.  Kemper  wrote  the  computer  programs  which 
created  Figures  111-18  through  111.21,  111-25  and  111-26. 

T<!renCe  W‘  Capta  „  John  W.  Woods,  and  Lieuten- 

lC  361  J'  MarCUS  pr0Vided  encouragement  and  technical  discussions  which 
improved  the  quality  of  the  report. 

On.  Carl  F.  Romney  asked  some  pertinent  questions  which  ulti¬ 
mately  led  to  a  technique  for  measuring  signal  degradation  by  the  adaptive 
beamfor.ner. 


Thomas  E,  Barnard 


ABSTRACT 


This  report  deals  with  results  obtained  from  operating  an  adaptive 
time-domain  maximum-likelihood  filtering  system  on  data  from  the  Alaska 
Long-Period  Array  (ALP A).  Signal-to-noise  gain  of  adaptive  filtering  re¬ 
lative  to  beamsteering  is  investigated  as  a  function  of  convergence  rate  and 
steer  direction.  In  addition,  the  effect  upon  signal-to-ncise  gain  of  freezing 
'.he  adaptive  filter  set  is  described.  Both  on-azimuth  and  off-azimuth  sig¬ 
nals  are  examined  to  determine  how  much  they  are  attenuated  in  the  adaptive- 
filter  beam  in  comparison  with  the  beamsteer  output.  Signal-to-noise  gain 
values  presented  are  measured  using  beam  output  traces  formed  from  actual 
seismic  data.  Theoretical  studies  of  the  effect  of  floating  DC  levels  and 
zoundoff  error  are  also  contained  in  this  report. 

A  major  advantage  of  adaptive  multichannel  filtering  is  found  to  be  the 
capability  to  narrow  the  main  lobe  of  the  array  beam  pattern. 


Neither  the  Advanced  Research  Projects  Agency  nor  the  Air  Force 
Technical  Applications  Center  will  be  responsible  for  information  contained 
herein  which  has  been  supplied  by  other  organizations  or  contractors,  and 
this  document  is  subject  to  later  revision  as  may  be  necessary.  The  views 
and  conclusions  presented  are  those  of  the  authors  and  should  not  be  inter¬ 
preted  as  necessarily  representing  the  official  policies,  either  expressed  or 
implied,  of  the  Advanced  Research  Projects  Agency,  the  Air  Force  Technica 
Applications  Center,  or  the  US  Government. 


IV 


TABLE  OF  CONTENTS 


SECTION 


TITLE 


PAGE 


ACKNOWLEDGMENTS 


ABSTRACT 


INTRODUCTION 

A„  PURPOSE  OF  THIS  STUDY 

B.  DESCRIPTION  OF  ALASKA  LONG- 
PERIOD  ARRAY  (ALP A) 

C.  ADAPTIVE  MULTICHANNEL 
FILTERING 

D.  DESCRIPTION  OF  THE  MAXIMUM- 
LIKELIHOOD  ADAPTIVE  FILTER 
ALGORITHM 

E.  GEOMETRICAL  INTERPRETATION 
OF  THE  ABF  ALGORITHM 

F.  A  SIMPLE  ILLUSTRATIVE  EXAMPLE 

G.  ORGANIZATION  OF  REPORT 

THE  PROBLEM  OF  FLOATING  MEANS 

A.  FLOATING  MEANS  IN  ALP  A  DATA 

B.  EFFECT  OF  DC  LEVELS  ON  ADAP¬ 
TIVE  FILTERING  IMPROVEMENT 

C.  TECHNIQUES  USED  TO  ELIMINATE 
DC  BIAS  FROM  THE  DATA 


EFFECT  OF  ROUNDOFF  ERROR  IN  THE 
FILTER  UPDATE  EQUATION 
A.  DISCUSSION 


III- 1 
III- 1 


SECTION 

III. 


IV. 


V. 


TABLE  OF  CONTENTS 
(continued) 


TITLE 

PAGE 

B. 

errors  associated  with  a  single 

COMPONENT  OF  THE  VECTOR  (X-X) 

III- 5 

C. 

directional  error  of  the  vector 

(X-X) 

HI-25 

D. 

ERROR  IN  THE  FILTER  OUTPUT 

X 

II 

c+ 

III-70 

E. 

directional  error  of  the  filter 

UPDATE  VECTOR  (AneW-  A°ld) 

III- 95 

F. 

SUMMARY 

III-I44 

SIGNAL- TO -NOISE  RATIO  IMPROVEMENT 

IN  THE  PRESENCE  OF  BACKGROUND  NOISE 

IV-1 

A. 

INTRODUCTION 

IV-1 

B. 

NOISE  REDUCTION  AS  A  FUNCTION 

OF  CONVERGENCE  RATE 

IV -3 

c. 

NOISE  REDUCTION  AS  A  FUNCTION 

OF  FREQUENCY 

IV-IO 

D. 

SIGNAL  DEGRADATION  AS  A 

FUNCTION  OF  CONVERGENCE  RATE 

IV- 14 

E. 

SIGNAL- TO -NOISE  GAIN  AS  A 

FUNCTION  OF  CONVERGENCE  RATE 

IV -20 

EFFECT  OF  FILTER  FREEZE  ON  SIGNAL- 

TO- 

NOISE  RATIO  IMPROVEMENT 

V-I 

A. 

general  DISCUSSION 

V-l 

B. 

EFFECT  OF  FILTER  FREEZE  ON 

SIGNALS 

V -4 

vi 


tit > to.-,  .-  >■•■■-. . . 


TABLE  OF  CONTENTS 
(continued) 

TITLE 

C.  EFFECT  OF  FILTER  FREEZE  ON 
NOISE  REDUCTION 
AN  ALTERNATE  ADAPTIVE 

algorithm 

VARIABILITY  OF  NOISE  REDUCTION 

A.  INTRODUCTION 

B.  VARIATION  IN  NOISE  REDUCTION 

at  different  steer  directions 
c.  VARIABILITY  OF  SIGNAL- TO- 
NOISE  GAIN 

CONCLUSIONS 

REFERENCES 


PAGE 

V-17 

V- 26 

VI- I 
VI-1 

VI-1 

VI- 27 

VII-  1 

VIII-  1 


vii 


LIST  OF  FIGURES 


FIGURE 


TITLE 


PAGE 


III— Z 


III— 3 


III-4 


III -5 


III- 6 


III- 7 


ALPA  GEOMETRY 

SCHEMATIC  DIAGRAM  OF  MULTICHANNEL 
FILTERING 

A  GEOMETRICAL  INTERPRETATION  OF 
TIME-DOMAIN  MAXIMUM- LIKELIHOOD 
ADAPTIVE  FILTERING 

A  SIMULATED  DISPLAY  OF  THE  TIME 
SERIES  INPUT  CORRESPONDING  TO  THE 
ILLUSTRATIVE  EXAMPLE 

PRE-FILTER  RESPONSE  (dB) 

ERROR  PROBABILITY  DENSITY  PER  COUNT 
AFTER  FORMATION  OF  VERTICAL  COM¬ 
PONENT 

ERROR  PROBABILITY  DENSITY  PER  COUNT 
AFTER  PREFILTERING  VERTICAL  COM¬ 
PONENT 

BEAMSTEER  ERROR  PROBABILITY  DENSITY 
PER  COUNT  BEFORE  ROUNDOFF 

PROBABILITY  DENSITY  PER  COUNT  FOR 
BEAMSTEER  ROUNDOFF  ERROR 

BEAMSTEER  ERROR  PROBABILITY  DENSITY 
PER  COUNT  AFTER  ROUNDOFF 

ERROR  PROBABILITY  DENSITY  PER  COUNT 
IN  THE  TERM  x(t)  -  x^t)  (BEAMSTEER  OUT¬ 
PUT  MINUS  A  SINGLE  CHANNEL)  BEFORE 
BEAMSTEER  ROUNDOFF 

ERROR  PROBABILITY  DENSITY  PER  COUNT 
IN  THE  TERM  x(t)  -  x.(t)  (BEAMSTEER  OUT¬ 
PUT  MINUS  A  SINGLE  CHANNEL)  AFTER 
BEAMSTEER  ROUNDOFF 


11-10 


III- II 


III- 1 5 


III- 1 7 


III  - 1 8 


III- 19 


III-22 


III  -  2  3 


viii 


LIST  OF  FIGURES 
(continued) 


FIGURE 
III -8 

III- 9 

III- 10 

III- 1 1 

III- 12 

III- 1 3 

III- 14 

III- 15 

III- 1 6 

III- 17 


TITLE 


PAGE 


PROBABILITY  DENSITY  PER  COUNT  SQUARED 
FOR  THE  RANDOM  VARIABLE  |E  |2  (SQUARFD 
MAGNITUDE  OF  ERROR  VECTORCpROJECTED 
ONTO  CONSTRAINT  SPACE) 

PROBABILITY  MASS  FUNCTION  FOR  THE 
RANDOM  VARIABLE  jEb|2 

CUMULATIVE  DISTRIBUTION  FUNCTION  AND 
HISTOGRAM  FOR  THE  RANDOM  VARIABLE 
IX-XI  (FOR  A  NOISE  SAMPLE  COVERING 
THE  INTERVAL  0757-1150  ON  DAY  23S  OF 
1970) 

CUMULATIVE  DISTRIBUTION  FUNCTION  AND 
PROBABILITY  DENSITY  PER_ DEGREE  FOR 
THE  ANGLE  a  BETWEEN  (X-X)  AND  E  IN 
A  155 -DIMENSIONAL  CONSTRAINT  SPACE 

TWO-DIMENSIONAL  PLANE  PASSING 
THROUGH  THE  ORIGIN,  (X-X),  AND  Ec 

CUMULATIVE  DISTRIBUTION  FUNCTION 
P(0  <  0Q)  OVER  A  RANGE  OF  THE  SQUARED 

magnitude  |x-x|2 

PROBABILITY  DENSITY  PER  DEGREE  FOR 
THE  ANGLE  BETWEEN  THE  VECTOR  (X-X) 

AND  THE  PROJECTION  OF  THE  VECTOR 

(X-X)  +  Ec  +  E,  ONTO  THE  CONSTRAINT 
SPACE 

TWO-DIMENSIONAL  PLANE  PASSING  THROUGH 
THE  ORIGIN,  (X-X)  +  Ec  +  Eb,  AND  (X-X) 

REGION  IN  THE  CUTTING  PLANE  WHERE 
't*  -  IN  TERMS  of  the  normalized 
CUTTING_PLANE  COORDINATES 
C=|EC|/|X-X|  AND  B=  |  E  b  |  /  |X-X| 

PROBABILITY  DENSITY  PER  DEGREE  FOR 

THE  ANGLE  <f>  BETWEEN  THE  VECTOR  (X-X) 

AND  THE  VECTOR  (X-X)  +  E  +  E 

C  h 


III- 3  6 

III-  38 

III-39 

III-46 

III -4  7 

III- 51 

III  -  52 

III- 54 

III- 6  6 

III- 69 


ix 


LIST  OF  FIGURES 
(continued) 


FIGURE 

TITLE 

PAGE 

III- 18 

ADAPTIVE  FILTER  OUTPUT  ERROR  PRO¬ 
BABILITY  DENSITY  PER  COUNT  BEFORE 
ROUNDOFF  (MAXIMUM  POSSIBLE  VARIANCE) 

III-74 

III- 1 9 

PROBABILITY  THAT  y(t)  CHANGES  SIGN  OR 
ROUNDS  TO  ZERO  GIVEN  |y(t)j  [MOST  FAVOR¬ 
ABLE  CASE:  y(t)  =  x(t)] 

III-76 

III-20 

PROBABILITY  THAT  y(t)  CHANGES  SIGN  OR 
ROUNDS  TO  ZERO  GIVEN  |y(t)|  [MOST  UN¬ 
FAVORABLE  CASE:  |a.(j)|  =  1/2] 

III— 7  7 

III- 21 

REGIONS  IN  THE  [y(t),  ey(t)]  PLANE  COR¬ 
RESPONDING  TO  SIGN  REVERSAL  ANL 
ROUNDING  TO  ZERO 

III-78 

III- 2  2 

TWO-DIMENSIONAL  PLANE  PASSING  THROUGH 
THE  ORIGIN,  D,  AND  K(X-X  +  E,  +  E  ) 

D  C 

III- 106 

III- 2  3 

REGIONS  IN  THE  (zj,  z2)-?LANE  SATISFYING 
THE  INEQUALITIES  I.  -  1/2  <  8.  (j)  +  8,( j)  <  I  + 
1/2  J  1  ~  J 

III- 113 

III- 24 

PROBABILITY  MASS  FUNCTION  FOR  THE 
RANDOM  VARIABLE  Y  =  L.  -  KMe  (t-j)  AS  A 
FUNCTION  OF  K  J  b 

III- 18 

III- 2  5 

PROBABILITY  DENSITY  FUNCTION  FOR  THE 

RANDOM  VARIABLE  Y  =  KMeb(t-j)  +  M-l  g.(j) 

AS  A  FUNCTION  OF  K  2 

i=l 

III- 12  3 

III-26 

PROBABILITY  MASS  FUNCTION  FOR  THE 
RANDOM  VARIABLE  Y=L.  AS  A  FUNCTION 

OF  K  J 

III- 124 

III-27 

ESTIMATED _ME AN  d  OF  THE  ANGLE 
BETWEEN  (X-X)  AND 

sgn[y(t)]  sgn[y'(t)]  (X-X  +  Ec  +  D  / K)  AS  A 
FUNCTION  OF  CONVERGENCE  RATE  (USING 
NOISE  DATA  FROM  DAY  238  OF  1970) 

III- 131 

—  "P.'V 


'4  Wf  ..VIILH^  .lIiUUIr..l,MU 


.wis*  *^.vy.  wjj 


*  * 


f 


FIGURE 


III-28 


111-29 

III- 30 

IV- 1 

IV-2 

IV -3 

IV-4 

IV-5 

IV- 6 

IV-7 


LIST  OF  FIGURES 
(continued) 

TITLE 

ESTIMATED _MEAN  0  OF  THE  ANGLE 
BETWEEN  (X-X)  AND 

s§n[y(t)]  sgn[y'(t)]  (X-X  +  E  +  Dc/K  +  C/K) 
AS  A  FUNCTION  OF  CONVERGENCE  RATE 
(USING  NOISE  DATA  FROM  DAY  238  OF  1970) 

SET  OF  POSSIBLE  INTEGER  CONSTRAINT- 
CONDITION  DISCREPANCIES  AND  TRANS¬ 
ITIONS  BETWEEN  DISCREPANCIES  FOR  THE 
FIVE-CHANNEL  CASE 

SET  OF  POSSIBLE  INTEGER  CONSTRAINT- 
CONDITION  DISCREPANCIES  AND  TRANS¬ 
ITIONS  BETWEEN  DISCREPANCIES  FOR  THE 
SIX -CHANNEL  CASE 

NOISE  REDUCTION  AS  A  FUNCTION  OF 
CONVERGENCE  RATE  DAY  238  OF  1970 

NOISE  REDUCTION  AS  A  FUNCTION  OF 
CONVERGENCE  RATE  DAY  203  OF  1971 

BEAMSTEER  AND  ADAPTIVE  FILTER 

OUTPUT  POWER  DENSITY  (DAY  238  1970, 

STEER  DIRECTION  270°,  K  =  0.  005) 

s 

ADAPTIVE  FILTER  NOISE  REDUCTION  AS 
A  FUNCTION  OF  FREQUENCY  (DAY  238 
1970,  STEER  DIRECTION  270°,  Kg=  0.  005) 

BEAMSTEER  AND  ADAPTIVE  FILTER  OUT¬ 
PUT  POWER  DENSITY  (DAY  203  1971, 

STEER  DIRECTION  270°,  Kg=  0.  005) 

ADAPTIVE  FILTER  NOISE  REDUCTION  AS 
A  FUNCTION  OF  FREQUENCY  (DAY  203  1971, 
STEER  DIRECTION  270°,  Kg=  0.  005) 

SIGNAL  DEGRADATION  AS  A  FUNCTION  OF 
CONVERGENCE  RATE  FOR  A  WEAK  SIGNAL 
APPROXIMATELY  6  dB  ABOVE  NOISE  LEVEL 
ON  BEAMSTEER  OUTPUT  (STEER  AZIMUTH 
302.  5°) 


PAGE 

III- 138 

III- 14  0 

III-  141 

IV- 5 

IV- 9 

IV- 11 

IV  - 12 

IV-1 3 

IV- 15 

IV-18 


IV-9 


IV- 10 


IV- 11 


IV- 12 


IV- 13 


IV -14 


IV- 1.5 


V-l 


SIGNAL  DEGRADATION  AS  A  FUNCTION  OF 
CONVERGENCE  RATE  FOR  A  SIGNAL  18  dB 
ABOVE  NOISE  LEVEL  ON  BEAMSTEER 
OUTPUT 

SIGNAL  DEGRADATION  AS  A  FUNCTION  OF 
CONVERGENCE  R  ATE  FOR  A  SIGNAL  24  dB 
ABOVE  NOISE  LEVEL  ON  BEAMSTEER 
OUTPUT 

MAGNITUDE  4.  9  EVENT  FROM  KAMCHATKA 
(FILTER  ADAPTING,  STEER  DIRECTION  27 3°, 
K&  =  0.  005) 

SIGNAL- TO -NO  ISE  GAIN  AS  A  FUNCTION  OF 
CONVERGENCE  RATE  FOR  A  WEAK  SIGNAL 
APPROXIMATE  I, Y  6  dB  ABOVE  NOISE  LEVEL 
ON  BEAMSTEER  OUTPUT  (STEER  AZIMUTH 
302  5°) 

WEAK  EVENT  FROM  300°-  305°  (FILTER 
ADAPTING,  STEER  DIRECTION  302.5°, 

K  =  0.  25) 

s 

SIGNAL-TO-NOISE  GAIN  AS  A  FUNCTION 
OF  CONVERGENCE  RATE  FOR  TWO 
STRONG  KAMCHATKA  SIGNALS  (USING 
NOISE  REDUCTION  MEASUREMENTS 
FROM  DAY  238  OF  1970) 

SIGNAL-TO-NOISE  GAIN  AS  A  FUNCTION 
OF  CONVERGENCE  RATE  FOR  TWO 
STRONG  KAMCHATKA  SIGNALS  (USING 
NOISE  REDUCTION  MEASUREMENTS 
FROM  DAY  203  OF  1971) 

MAIN  RAYLEIGH  WAVE  ARRIVAL  FROM 
5.  9  NEW  GUINEA  EVENT  (JANUARY  7,  1972) 


IV-22 

IV -24 

IV -25 

IV -2  9 

IV-30 

IV -34 

IV-  37 

V- 5 


LIST  OF  FIGURES 
(continued) 


FIGURE 

TITLE 

PAGE 

V-2 

event  probably  from  south  of 

PANAMA  (LASA  rr>b  =  3.  6) 

V -8 

V-3 

MAGNITUDE  4.  5  EVENT  FROM  KAMCHA¬ 
TKA  (FILTER  FROZEN,  STEER  DIRECTION 

273°,  Ks  =  0.  005) 

V-10 

V -4 

MAGNITUDE  4.7  EVENT  FROM  SOUTH  OF 
PANAMA 

V-ll 

V -5 

WEAK  EVENT  FROM  300°-  305°  (FILTER 
ADAPTING,  STEER  DIRECTION  273°, 

K  =  0.  005,  FILTER  FROZEN  DURING 
EARLIER  KAMCHATKA  EVENT) 

V-12 

V-6 

WEAK  EVENT  FROM  300°-  305°  (FILTER 
ADAPTING,  STEER  DIRECTION  273°, 

Ks  =  0.  005,  FILTER  ADAPTING  DURING 

EARLIER  KAMCHATKA  EVENT) 

V-l  3 

V -7 

MAGNITUDE  4.  9  EVENT  FROM  KAMCHA¬ 
TKA  (FILTER  FROZEN,  STEER  DIRECTION 

V -8 

273°,  Kg  =  0. 005) 

V-15 

MAGNITUDE  4.  3  EVENT  FROM  ANDREANOF 
ISLANDS 

V- 1 6 

V -9 

measured  loss  in  noise  reduction 

AS  A  FUNCTION  OF  TIME  ELAPSED  SINCE 
FILTER  FREEZE  (USING  NOISE  FROM  DAY 

V-10 

232  OF  1970) 

V -22 

apparent  trend  of  noise  reduction 

LOSS  AS  A  FUNCTION  OF  TIME  ELAPSED 
SINCE  FILTER  FREEZE  (USING  NOISE 

FROM  DAY  232  OF  1970) 

V-23 

/-ll 

TREND  OF  SIGNAL- TO-NOISE  GAIN  (WITH 
FILTER  FROZEN)  FOR  KAMCHATKA  EVENT 

18  dB  ABOVE  BEAMSTEER  NOISE  LEVEL 

(USING  NOISE  FROM  DAY  232  OF  1970) 

V-25 

xiii 


FIGURE 
VI- 1 

VI-2 

VI-3 

VI-4 

VI-5 

VI-6 

VI-7 

VI-8 

VI-9 

VI- 10 

VI- 11 


LIST  OF  FIGURES 
(continued) 

TITLE 

TWO  MICROSEISMIC  BURSTS  WHICH 
TRIGGERED  FILTER  FREEZE  (DAY  321 
1971,  STEER  DIRECTION  90°,  Kg=  0.005) 

BEAMSTEER  AND  ADAPTIVE  FILTER  OUT¬ 
PUT  POWER  DENSITY  (DAY  321  1971, 

STEER  DIRECTION  0°,  K  =  0.  005) 

s 

ADAPTIVE  FILTER  NOISE  REDUCTION  AS 
A  FUNCTION  OF  FREQUENCY  (DAY  321 
1971,  STEER  DIRECTION  0°,  K  =  0.005) 

BEAMSTEER  AND  ADAPTIVE  FILTER  OUT¬ 
PUT  POWER  DENSITY  (DAY  321  1971, 

STEER  DIRECTION  90°,  K  =  0.  005) 

s  *  ' 

ADAPTIVE  FILTER  NOISE  REDUCTION  AS 
A  FUNCTION  OF  FREQUENCY  (DAY  321 
1971,  STEER  DIRECTION  90°,  K  =  0.005) 

BEAMSTEER  AND  ADAPTIVE  FILTER  OUT¬ 
PUT  POWER  DENSITY  (DAY  321,  1971, 

STEER  DIRECTION  180°,  Kg  =  0.  005) 

ADAPTIVE  filter  noise  reduction  as 
A  FUNCTION  OF  FREQUENCY  (DAY  321 
1971,  STEER  DIRECTION  180°,  K  =  0.  005) 

BEAMSTEER  AND  ADAPTIVE  FILTER  OUT¬ 
PUT  POWER  DENSITY  (DAY  321  1971, 

STEER  DIRECTION  270°,  K  =  0.005) 

ADAPTIVE  FILTER  NOISE  REDUCTION  AS 
A  FUNCTION  OF  FREQUENCY  (DAY  321 
1971,  STEER  DIRECTION  270°,  K  =  0.005) 

WEAK  EVENT  WHICH  TRIGGERED  FILTER 
FREEZE  (DAY  203  1971,  STEER  DIRECTION 
270°,  K  =  0.  005) 

NEW  IRELAND  EVENT  WITH  GLITCH  (ORIGIN 
TIME  04:37:09.  1)  (DAY  203  1971,  STEER 
DIRECTION  180°,  Kg=  0.005) 


PAGE 

VI-3 

VI-4 

VI-5 

VI-6 

VI-7 

VI-8 

VI-9 

VI-10 

VI-11 

VI-14 

VI-15 


xiv 


figure 

VI-12 

VI- 13 

VI-14 

VI-15 

VI-16 

VI-17 

VI-18 

VI-19 

VI-20 

VI-21 

VI-22 


LIST  OF  FIGURES 
(continued) 

title 

NE  W  IRE  LAND  EV  ENT  WITH  n  r  t'-tc-it  , 

TIME  04:37:09.  1,  (D”y  z”™wf1^£,UC5,N 
DIRECTION  180°,  Ks=  0.  005)  ' 

QUIETEST  NOISE  SECTION  BETWEEN  03^5 

ssfsrr/.rss:-;;-' 

(Dl™IoT3UI?9,t4cVORTHERN  celebes  event 
o.ool)  '  STEER  DIRECTION  270°.  Ks  = 

DIRECTION  0°,  K  =  0.005)  R 


FILTER  NOISE  REDUCTION  AS  A 

203 

DIRECTION  90°.  Ks  -  0.  005)  ER 

ADAPTIVE  FILTER  NOISE  REDUCTION  aq 

wS™F.nEOlB““«™ 

1971,  STEER  DIRECTION  90°,  Kg=  0.  005) 

BEAMSTEER  and  ADAPTIVE  FILTER  DITT 
PUT  POWER  DENSITY  ( DAY  ER  OUT" 

DIRECTION  180°,  Kg  =  0.  005)  STEER 

a™ctoSER  NOISE  reduction  as 
A  FUNCTION  OF  FREQUENCY  (DAY  ?ru 

1971,  STEER  DIRECTION  180°,  K  =  0.  005) 

NOISE  REDUCTION  AND  SIGNAL  TO 
NOISE  GAIN  (6  dB  S/N  RATIO,  FOR  EACH 

prFoTcHeEsseEdN  FOUR-HOTO  NOISE 


PAGE 


VI-16 


VI-18 


VI-19 


VI-20 


VI-21 


VI-22 


VI-23 


VI-24 


VI-25 


VI-26 


VI-29 


xv 


LIST  OF  TABLES 


TABLE 

TITLE 

PAGE 

1-1 

ALPA  SITE  LOCATIONS 

1-3 

II- 1 

PRE-FILTER  WEIGHTS 

II- 10 

III-  1 

STANDARD  DEVIATION  OF  ERROR  AND 
MAXIMUM  ERROR  AT  EACH  STAGE  OF 
PROCESSING  FOR  UNSCALED  DATA, 

DATA  SCALED  BY  16,  AND  FLOATING¬ 
POINT  DATA 

III- 24 

III -2 

VARIANCE  AND  RATIO  OF  VARIANCE  TO 
SQUARED  MEAN  FOR  THE  RANDOM 

VARIABLE  €  .(t-j)  (SQUARED  PREFILTERED 
VERTICAL-COMPONENT  OUTPUT  ERROR) 

III  -  34 

III  -  3 

MEAN,  STANDARD  DEVIATION,  AND 

VARIANCE  OF  (Ej2  (SQUARED  MAGNITUDE 

OF  ERROR  VECTOR  PROJECTED  ONTO 
CONSTRAINT  SPACE) 

III- 3  5 

III -4 

MAXIMUM  PROBABILITY  OF  SIGN  INVER¬ 
SION  OR  ROUNDING  TO  ZERO  IN  ADAPTIVE 
FILTER  OUTPUT  (DAY  238,  STEER  DIREC¬ 
TION  270°,  Ks  =  0.  005) 

III- 8 1 

III -5 

NOISE  REDUCTION  AS  A  FUNCTION  OF 

DATA  SCALE  FACTOR  (DAY  7  1  j72,  0430 

TO  0648) 

III- 9  3 

III- 6 

PROBABILITY  OF  NO  MOVEMENT  AND 
STANDARD  DEVIATION  OF  THE  CHANGE  IN 

A  FILTER- WEIGHT  COMPONENT  WHOSE  RMS 
CHANGE  OVER  TIME  IS  THE  SAME  AS  THE 

RMS  FILTER- WEIGHT  CHANGE  ACROSS  COM¬ 
PONENTS 

III- 101 

III- 7 

MAXIMUM  POSSIBLE  ANGLE  OF  DEFLEC¬ 
TION  DUE  TO  ROUNDING  THE  FILTER  UP¬ 
DATE  VECTOR  AT  THE  CONVERGENCE 

RATE  K  =  0.  005  (USING  NOISE  DATA  FROM 

DAY  2  38  OF  1970) 

III- 1 07 

xvi 


>  w  ippjiupw  j  iy,mjiiL|j4 1  4.  j  y  1141^1^4414 


t  T» 


WW- 


•.«  J^llJ  ,#1!  m  W4  .'•'  WPI4IUIJ 


i 

’•ilt! 


* 


T 

] 

<r* 


TABLE 
III  -  8 


III  -  9 


IV-1 


IV  -2 


IV  -  3 


IV -4 


IV -5 


IV -6 


LIST  OF  TABLES 
(continued) 

TITLE 

ESTIMATED_MEAN  0  OF  ri  HE  ANGLE 

BETWEEN  (X-X)  AND 

«gn[y(t)]  sgn  [y'(t)]  (X-X  +E  +D  /K) 

AS  A  FUNCTION  OF  THE  CONVERGENCE 
FACTOR  Ks  (USING  NOISE  DATA  FROM 
DAY  238  OF  1970) 

ESTIMATE  DEMEAN  ~0  OF  THE  ANGLE 
BETWEEN  (X-X)  AND 

sgn  Cy(t)]  sgnCy'(t  0  (X-X+Ec+D  /K+C/K) 
AS  A  FUNCTION  OF  THE  CONVERGENCE 
FACTOR  Kg  (USING  NOISE  DATA  FROM 
DAY  238  OF  1970) 

ADAPTIVE  FILTERING  NOISE  REDUCTION 
VERSUS  CONVERGENCE  RATE  (DAY  238 

1970) 

ADAPTIVE  FILTERING  NOISE  REDUCTION 
VERSUS  CONVERGENCE  RATE  (DAY  203 

1971) 

ADAPTIVE  FILTERING  SIGNAL  DEGRADA¬ 
TION  VERSUS  CONVERGENCE  RATE  FOR  A 
WEAK  SIGNAL  FROM  300°-305°  (DAY  276 
1971) 

ADAPTIVE  FILTERING  SIGNAL  DEGRADA¬ 
TION  VERSUS  CONVERGENCE  RATE  FOR  A 
STRONG  SIGNAL  FROM  KAMCHATKA  (DAY 
276  1971) 

ADAPTIVE  FILTERING  SIGNAL  DEGRADA¬ 
TION  VERSUS  CONVERGENCE  RATE  FOR  A 
VERY  STRONG  SIGNAL  FROM  KAMCHATKA 
(DAY  276  1971) 

ADAPTIVE  FILTERING  SIGNAL-TO- NOISE 
GAIN  VERSUS  CONVERGENCE  RATE  FOR  A 
WEAK  SIGNAL  FROM  300°-305°  (USING 
NOISE  REDUCTION  MEASUREMENTS  FROM 
DAY  238  OF  1970) 


PAGE 


III-  1  30 


III- 137 


IV -4 


IV -8 


IV  - 1 7 


IV  -2 1 


IV -23 


IV -27 


1 


xvn 


LIST  OF  TABLES 
(continued) 


TABLE 
IV -7 


IV -8 


IV- 9 


IV-10 


IV- 11 


V-l 


V -2 


VI- 1 

VI-2 


TITLE  PAGE 

ADAPTIVE  FILTEF.ING  SIGNAL-TO-NOISE 

GAIN  VERSUS  CONVERGENCE  RATE  FOR  A 

WE AK  SIGNAL  FROM  300°-305°  (USING 

NOISE  REDUCTION  MEASUREMENTS  FROM 

DAY  203  OF  1971)  IV-28 

ADAPTIVE  FILTERING  SIGNAL-TO-NOISE 

GAIN  VERSUS  CONVERGENCE  RATE  FOR  A 

STRONG  SIGNAL  FROM  KAMCHATKA 

(USING  NOISE  REDUCTION  MEASUREMENTS 

FROM  DAY  238  OF  1970)  IV-32 

ADAPTIVE  FILTERING  SIGNAL-TO-NOISE 

GAIN  VERSUS  CONVERGENCE  RATE  FOR  A 

VERY  STRONG  SIGNAL  FROM  KAMCHATKA 

(USING  NOISE  REDUCTION  MEASUREMENTS 

FROM  DAY  238  OF  1970)  IV-33 

ADAPTIVE  FILTERING  SIGNAL-TO-NOISE 

GAIN  VERSUS  CONVERGENCE  RATE  FOR  A 

STRONG  SIGNAL  FROM  KAMCHATKA 

(USING  NOISE  REDUCTION  MEASUREMENTS 

FROM  DAY  203  OF  1971)  IV-35 

ADAPTIVE  FILTERING  SIGNAL-TO-NOISE 
GAIN  VERSUS  CONVERGENCE  RATE  FOR  A 
VERY  STRONG  SIGNAL  FROM  KAMCHATKA 
(USING  NOISE  REDUCTION  MEASUREMENTS 


FROM  DAY  203  OF  1971)  IV-36 

PDE  EVENTS  ARRIVING  AT  ALPA  BETWEEN 

2000  AND  2357  ON  OCTOBER  3,  1971  V-7 

BROADBAND  NOISE  REDUCTION  AS  A 

FUNCTION  OF  TIME  FOR  AN  ADAPTING 

FILTER  SET  AND  A  FROZEN  FILTER  SET 

(USING  DATA  FROM  DAY  232  OF  1970)  V-20 

EVENTS  ARRIVING  AT  ALPA  BETWEEN 

0355  AND  0725  ON  JULY  22,  1971  VI-13 

NOISE  REDUCTION  FOR  TEN  FOUR-HOUR 

SAMPLES  VI-28 


xviii 


it  -  .....  _  . 


■  - 


SECTION  I 
INTRODUCTION 


A.  PURPOSE  OF  THIS  STUDY 

The  adaptive  processing  task  of  the  Extended  Array  Evaluation  Pro¬ 
gram  has  as  its  objectives: 

•  To  gain  experience  in  operating  a  real-time  adaptive  signal 
estimation  processor  based  on  the  time -domain  maximum- 
likelihood  algorithm 

•  To  perform  theoretical  studies  relating  to  the  convergence  of 
the  algorithm  and  to  analyze  the  output  of  the  adaptive  pro¬ 
cessor  in  an  attempt  to  upgrade  its  performance 

This  report  deals  solely  with  the  problems  associated  with  operating  the  real¬ 
time  adaptive  processor  on  ALPA  data.  A  modified  version  of  the  TI  interim 
ALPA  system  was  used  to  implement  the  adaptive -filter ing  algorithm.  Theo¬ 
retical  studies  of  the  effect  of  floating  means  and  roundoff  error  upon  filter 
performance  are  contained  in  this  report. 


B0 


DESCRIPTION  OF  ALASKA  LONG-PERIOD  ARRAY  (ALPA) 


The  ALPA  array  is  shown  in  Figure  1-1.  ALPA  is  a  19-element  hex¬ 
agonal  array  with  20  km  spacing  between  sites.  Table  1-1  gives  the  ALPA 
site  locations.  In  this  report,  sites  are  referred  to  in  terms  of  their  trans¬ 
mission  order.  Thus,  site  10  refers  to  site  3-45  in  the  official  nomenclature. 


1-1 


+ 


14 

3-6 


13 

3-5  6 


16 

2-6 


15 

3-16  7  . 

3-1 


17 

2-1 


18 


12 

2-5 


8 

1-1 


19 


0 

*  9 

45  . 

2-4 


3-4  3-34 


2-2 


6 

2-3 


1 

3-23 


3 


3-3 


3-12 


2 

3-2 


10  km 

FIGURE  1-1 
ALPA  GEOMETRY 


1-2 


C.  ADAPTIVE  MULTICHANNEL  FILTERING 

Multichannel  filtering  is  a  form  of  array  processing  in  which  multiple 
channel  inputs  undergo  individual  frequency-shaping  and  phase-shift  filtering 
prior  to  the  channel- summation  operation  which  produces  the  beamformer 
output.  Figure  I-Z  is  a  schematic  diagram  of  multichannel  filtering.  This 
illustration  incorporates  the  option  to  preprocess  the  transducer  outputs  be¬ 
fore  they  are  input  to  the  multichannel  beamformer.  Examples  of  prepro¬ 
cessing  are  frequency  filtering  (most  commonly  with  identical  frequency  re¬ 
sponses  on  all  channels)  and  time  shifting  to  align  waves  emanating  from  a 
particular  direction.  The  preprocessed  transducer  outputs  become  the  input 
channels  to  a  multichannel  filter  set,  where  individual  filters  (generally  dif¬ 
ferent  from  channel  to  channel)  are  applied  to  the  input  channels.  Th^se  fil- 
teis  are  implemented  as  convolution  filters  in  time-domain  processing  or  as 
complex-valued  multiplicative  filters  in  frequency-domain  processing.  The 

multichannel  filter  output  is  created  by  summing  the  individual  filtered  chan¬ 
nel  outputs. 

In  systems  where  second-order  statistics  (crosscorrelation  functions 
and  crosspower  spectra)  are  used  to  describe  interrelationships  among  the 
input  channels,  there  are  two  basic  forms  of  multichannel  filtering.  In 
Wiener -Kolmogorov  multichannel  filtering,  the  average  squared  error  be¬ 
tween  the  desired  signal  and  the  multichannel  filter  output  is  minimized.  To 
minimize  the  mean  square  error,  the  crosscorrelation  functions  or  cross¬ 
power  spectra  between  the  input  channels  and  the  desired  signal  are  required. 
In  maximum -likelihood  multichannel  filtering,  the  average  squared  output 
from  the  multichannel  filter  set  is  minimized  subject  to  signal-preservation 
constraints  which  place  some  suitably-chosen  frequency  response  on  the  sig¬ 
nal.  For  maximum-likelihood  multichannel  filtering,  unlike  Wiener- 
Kolmogorov  filtering,  only  the  direction  of  the  signal  needs  to  be  specified, 
but  not  the  s  ignal-to-noise  ratio. 


1-4 


w 

yy 


fH 

, 

<u 

Xh 

c 

<U 

c 

-M 

as 

•H 

XS 

U 

1  f-H 

<u 

Xh 

c 

<u 

c 

rt 

•iH 

-C 

U 

£ 

I 

Ih 

O 

r-H 

in 

H 

w 

V 

V 

c 

u 

c 

o 

nJ 

In 

x: 

a 

U 

<u 

u 

a, 

u 

o 

fM 

to 

w 

<u 

<u 

C 

u 

C 

0 

n3 

u 

a 

u 

<U 

u 

CL 

U 

2 

0 

to 

to 

“H 

0) 

a> 

d 

d 

u 

0 

It 

«n 

a 

01 

u 

CL 

F1L1 


Multichannel  filtering  can  be  employed  with  fixed  or  time-varying 
filter  sets.  When  the  filters  are  updated  as  new  data  inputs  enter  the  multi¬ 
channel  processor,  the  process  is  called  adaptive  filtering.  Adaptive- 
filtering  algorithms  with  significant  computational  advantages  over  fixed 
multichannel  filtering  are  available.  When  the  inputs  to  the  multichannel 
processor  are  time-stationary  (in  the  wide  sense),  these  algorithms  yield 
filter  sets  which  converge  in  the  mean  to  the  corresponding  fixed  multi¬ 
channel  filter  sets.  After  adaptive  filter  sets  reach  the  vicinity  of  the  cor¬ 
responding  fixed  filter  sets,  they  fluctuate  about  the  fixed-filter  solution  in 
the  presence  of  time-stationary  data:  the  adaptive  filters  converge  in  the 
mean  in  the  sense  that  the  average  position  of  the  fluctuating  adaptive  filters 
is  identical  to  the  fixed-filter  solution.  When  the  statistics  of  the  data  enter- 
mg  the  multichannel  processor  slowly  change  with  time,  adaptive  filtering 
can  react  to  the  changes  in  a  semi-continuous  manner.  If  fixed  filtering  is 
used  in  this  situation,  newly-designed  filters  change  in  a  more  abrupt  fash¬ 
ion.  When,  as  in  this  case,  the  statistics  of  the  data  shift  with  time,  the 
adaptive-filter  solution  lags  behind  the  fixed-filter  solution  corresponding 
to  the  instantaneous  statistics.  The  extent  of  the  lag  can  be  controlled  by 
changing  the  adaptation  rate.  The  choice  of  an  adaptation  rate  involves  a 
tradeoff  between  readjustment  (higher-than-optimum  error  or  power  due 
to  the  adaptive -filter  fluctuations)  and  the  lag  behind  the  optimum  instantan¬ 
eous  fixed-filter  solution.  A  different  kind  of  lag  occurs  when  fixed  filter 
sets  are  periodically  redesigned:  statistics  must  be  accumulated  over  a 
design  interval  so  that,  as  a  result,  the  fixed-filter  solution  cannot  be  im¬ 
plemented  until  the  next  design  interval. 

In  the  conventional  technique  of  array  processing,  simple  time  delays 
or  phase  shifts  are  applied  to  the  input  channels  b  ore  summing  to  generate 
the  beam  output.  Optimum  multichannel  filtering  ■  Produces  considerable 
new  flexibility  into  the  beamforming  process.  „i,  ce  it  is  possible  to  weight 


1-6 


the  input  channels  differently,  channels  with  higher  signaJ.-to-noise  ratios 
can  be  emphasized  at  the  expense  of  noisier  channels.  When  well  instru¬ 
mented  arrays  are  utilized,  this  capability  is  generally  of  minor  importance. 
Afar  more  consequential  feature  of  adaptive  filtering  is  the  ability  to  form 
array  antenna  patterns  which  optimally  pass  a  signal  while  simultaneously 
rejecting  propagating  noise.  Deep  nulls  can  be  aimed  toward  off-beam 
noise  sources.  When  strong  off-azimuth  noise  sources  are  present,  the 
creation  of  such  nulls  is  an  automatic  result  of  the  optimality  of  the  multi¬ 
channel  processor.  The  conventional  time- shift- and- sum  or  phase-shift 
processor,  in  contrast,  has  a  beam  pattern  determined  solely  by  the  steer 
direction  and  the  array  geometry. 

In  most  cases,  the  potential  improvement  of  optimum  multichannel 
filtering  relative  to  beamsteering  is  determined  by  the  coherence  of  the 
noise  field  across  an  array:  the  greater  the  similarity  of  the  noise  field 
from  channel  to  channel,  the  greater  is  the  optimum-multichannel-filter 
improvement  over  beamsteering.  When,  on  the  other  hand,  noise  is  com¬ 
pletely  uncorrelated  between  sensors  and  identical  signal  and  noise  power 
levels  are  encountered  at  all  array  sites,  there  is  no  potential  for  improve¬ 
ment:  in  this  case,  the  optimum  filter  set  is  a  beamsteer  processor.  The 
decision  to  employ  or  not  to  employ  an  optimum-filter  technique  of  proces¬ 
sing  depends  critically  on  measurements  of  the  noise  field  at  any  given  array. 
Once  these  measurements  are  available,  the  additional  cost  of  implementing 
an  optimum -filter  system  can  be  quantitatively  weighed  against  the  advan¬ 
tages  of  greater  noise  suppression  relative  to  the  conventional  beamsteer 
processing  technique. 

D.  DESCRIPTION  OF  THE  MAXIMUM- LIKELIHOOD  ADAPTIVE  FILTER 

ALGORITHM 

The  adaptive -filter  output  y(t)  at  time  t  is  formed  by  applying  a 
convolution  filter  to  each  channel  and  summing  the  outputs  of  all  channels: 


M 


N 


y(t)  =  Xj  l  ai(j)^i(t-j) 

i=l  j  =  -N 

where  a.(j)  is  the  filter  weight  for  the  i-th  channel  at  a  lag  of  j  sample 
points,  x.(t-j)  is  the  value  of  the  channel  i  at  time  t-j.  M  is  the  number 
of  channels,  and  2N+1  is  the  total  length  of  the  filter  in  points.  Prior  to 
forming  the  filter  output,  each  channel  is  time -shifted  to  time-align  energy 
arriving  from  +he  desired  steer  direction. 

The  adaptive  filter  weights  are  updated  by  the  following  algorithm: 


where 


new 

old 

aj(j)  = 

a^j) 

+ 

A(t)  y(t) 

M 

x(t-j)  = 

1 

M 

E 

x.(t-j) 

i=l 

and  A(t)  is  the  convergence  parameter  at  time  t  .  This  update  algorithm 
incorporates  the  maximum-likelihood  constraints. 

The  convergence  parameter  \(t)  is  calculated  by  the  formula 

2K 


A(t) 


M 


(2N+1)  E  p.(t) 


i=l 


where  Ks  is  an  input  parameter,  and  P.(t)  is  a  moving  power  average  for 
the  i-th  channel.  P.(t)  is  computed  by  the  formula: 


P.(t)  =  (1  -  U) 


x(t)  -  x.(t) 


+  ^P.(t-l) 


t  >  1 


where  H  is  an  input  parameter.  P.(0)  is  aero,  and  several  values  of  P.(t) 
are  computed  before  the  filter  is  allowed  to  vary.  1 


1-8 


E„  GEOMETRICAL  INTERPRETATION  OF  THE  ABF  ALGORITHM 

In  vector  form,  the  adaptive  -  beamforming  filter  update  equation  may 
be  written 


Anew 

A 


.old 
-  A 


A(t)  XTA°ld(X-X)  , 


where  the  superscript  T  denotes  transposition,  and  where  the  filter  weight 
vector  A,  the  data  vector  X,  and  the  beamsteer  output  vector  X  are, 
respectively, 


(-N) 


v-N> 


(0) 


a  (0) 
M'  ’ 


&1  (N) 


*M(N> 


Xj  (t+N) 


XM(t+N> 


x,  (t) 


XM(t> 


x,  (t-N) 


and  X  = 


x(t+N) 


x(t+N) 


x(t-N) 


x(t-N) 


The  objective  of  maximum-likelihood  adaptive  beamforming  is  to 
duce  the  average  squared  filter  output 


y2(t)  =  (ATX)(XTA)  =  at  xxt  A 


subject  to  a  set  of  signal-preservation  constraints  on  the  filter  vector  A. 
After  preshifting  the  input  channels  to  time-align  energy  from  the  look  dir- 
ection,  these  constraints  can  be  written 


1-9 


(j  =  -N,  .  . .  ,  -1,  0,  1 . N), 


M 

E  ai(j)  = 

i=l 


where  the  constants  d(j)  specify  a  convolution  filter  having  the  desired  fre¬ 
quency  response  on  a  signal  from  the  steer  direction.  When  such  a  signal 
s(t)  appears  in  identical  form  on  all  channels,  the  signal  output  from  the 
beamformer  is 


s(t-j) 


N 

E  d(j)s(t-j)  • 

j  =  -N 


For  the  adaptive  beamforming  employed  in  this  study,  a  white  frequency 
response  is  specified  by  setting 


dU)  =  8jo  (j  =  -N . -1,  0,  1 . N), 

where  8.  is  the  Kronecker  delta  operator 
jo  K 

,  J1  i£  i  =  ° 

J°  to  if  j  ±  0. 

2 

To  reduce  the  average  squared  filter  output  y  (t)  ,  the  method  of 
steepest  descent  (with  two  modifications)  is  used.  In  the  unmodified  form  of 
the  method  of  steepest  descent,  the  filter  vector  A  moves  in  the  direction 
opposite  to  the  gradient  of  the  average  squared  filter  output  y^(t)  : 

.new  Aold  *1,  .  T  T  T 

A  -  A  =  -Cy(A  XX  A)  =  -2c  XX  1  A  . 

rp 

The  first  modification  is  to  replace  the  crosscorrelation  matrix  XX 

T 

with  the  rank-one  matrix  XX  formed  from  the  instantaneous  vector  X  at 
time  t  : 

.new  .old  „  ,  T 

A  -A  =  -2cX(X  A)  =  -2c  X  y(t)  . 


1-10 


This  modification  of  Widrow  (Widrow,  1966)  approximates,  in  effect,  the 
crosscorrelation-matrix  time  averaging  through  successive  applications  of 
the  filter  update  algorithm.  The  approximation  becomes  increasingly  ac¬ 
curate  as  the  rate  of  change  of  the  filter  vector  A  is  slowed  by  reducing  the 
scalar  convergence  parameter  c  .  The  filter  vector  A  converges  in  the 
mean  under  suitably  prescribed  conditions  (Daniell,  1968)  to  the  vector  ob¬ 
tained  by  using  the  crosscorrelation  matrix  XXT  in  the  update  equation. 
Ultimately  the  filter  vector  A  oscillates  about  its  mean.  The  size  of  the 
oscillations  can  be  controlled  by  varying  the  parameter  r  .  The  reason  for 
the  Widrow  modification  is  a  reduction  in  the  computational  operations  re¬ 
quired  for  the  filter  update  from  a  number  of  proportional  to  the  square  of 
the  dimension  of  the  vectors  X  and  A  to  a  number  linearly  proportional 
to  their  dimension. 


The  second  modification  of  the  steepest-descent  method  is  to  alter 
the  direction  of  the  vector  -c  V  y2(t)  so  that  the  ensuing  update  vector 

.  .new  old.  .  L  J  f  2  1 

(A  -A  )  is  the  vector  nearest  to  -cV  y  (t)  which  satisfies  the  con¬ 
straints  on  the  filter  update  vector.  Since  the  sum  across  channels 
M 

E  ai(jl 

i=l 


of  the  filter  vector  A  is  a  fixed  value  d(j)  at  any  lag  value  j  ,  the  filter 
,  .  .  .  .new  .old. 

update  vector  (A  -  A  )  must  sum  to  zero  at  each  lag: 

M 


E 


new  old 
ai(j)  -  a.(j) 


=  0  (j  =  -N, .  . .  ,  -1,  0,  1,  .  .  .  ,  N). 


i=l  - 


The  filter  update  vector  must  be  perpendicular  to  each  of  the  (2N+1)  unit 
vectors  specified  by  their  components 


1-11 


[-,«>]  k  =  - 


Where  Sjk  iS  the  Kronecker  delta  operator.  Each  of  the  vectors  UR  is  zero 
except  in  the  k-th  lag  position,  where  all  components  are  equal  to 
In  vector  form,  the  filter  update  vector  must  satisfy  the  (2N+1)  constraints 


T  .new 

°k <A 


.old. 

A  )  =  0 


(j  =  -N . -1,  0,  1,  ....  N). 


The  vector  X,  since  it  has  identical  components  at  any  lag  value  j  ,  is  a 
linear  combination  of  the  vectors  and  is  perpendicular  to  all  possible 

update  vectors  satisfying  the  constraint  conditions.  The  vector  (X-X)  ,  on 
the  other  hand,  satisfies  the  constraint  conditions: 

M  N 


Uk(X_X)  =  XI  Sjk  [Xi(j)  ‘  X(j}] 


i=l  j=-N 
M 


r  m 


Z  [xi(k)  - x(k)]  =  Z  *i(k) 


i=l 


L  i=l 


-  Mx(k) 


=  Mx(k)  -  Mx(k)  =  0  . 

Thus  the  vector  X  can  be  resolved  into  two  mutually  orthogonal  components 
X  (perpendicular  to  the  constraint  space  for  the  filter  update  vector)  and 
X-X  (lying  within  the  constraint  space).  The  negative  -cv[y2(t)]  of  the 
scaled  gradient  of  y2(t)  is  a  scalar  multiple  -[2cy(t)]X  of  the  vector  X. 
The  nearest  point  to  -c  V  [y  (t)J  on  the  constraint  space  is  the  vector 
[2cy(t)]  (X-X)  formed  by  subtracting  the  component  -[2cy(t)]x  perpendi¬ 
cular  to  the  constraint  space  from  the  scaled  negative  gradient  vector 
"  [2cy(t)j  X  .  The  final  form  of  the  filter  update  equation  is,  therefore, 


1-12 


This  situation  is  illustrated  in  Figure  1-3.  The  resultant  filter  update  vector 
is  the  projection  of  -c  V  T  y  (t)l  onto  the  constraint  space. 


F.  A  SIMPLE  ILLUSTRATIVE  EXAMPLE 

Suppose  that  three  channels  are  input  to  an  adaptive  beamformer  with 
one  lag  per  channel.  Signals  are  defined  to  be  simultaneous  spikes  on  ail 
three  channels.  Noise,  on  the  other  hand,  appears  in  the  form  of  unit- 
^-^plitude  spikes  on  the  second  channel.  When  signal  and  noise  occur,  they 
are  characterized  by  the  respective  vectors 

1  . 

Xj(t)  =  s  r  x  (t)  =  o 

x2(t)  -  s  and  N  =  x^t)  =  1 

x3(t)  =  s  J  x3(t)  =  0 

Figure  1-4  depicts  time  series  inputs  for  this  simple  example.  The  adaptive 
filter  set  is  initialized  with  beamsteer  weights: 

aj  (0)  =  1/3 
a2(0)  =  1/3 
a3(0)  =  1/3 

An  optimum  filter  set  is 

aj(0)  =  1/2 

a2(0)  =  0 
a3(0)  =  1/2 

1 


1-13 


Channel  1 


/  \_ 


Channel 


Channel 


Channel 


Channel 


Channel 


(a)  Signal  Model 


1 


3  - 

(b)  Noise  Model 


FIGURE  1-4 

A  SIMULATED  DISPLAY  OF  THE  TIME  SERIES  INPUT 
CORRESPONDING  TO  THE  ILLUSTRATIVE  EXAMPLE 


Whenever  a  noise  spike  first  appears  on  channel  2,  the  filter  vector  A  is 
updated  according  to  the  equation 


The  adaptive  update  equation  shifts  its  weighting  from  channel  2  to  channels 
1  and  3.  The  second  time  the  noise  spike  appears  on  channel  2,  the  filter 
output  is 


1-16 


I 

As 


*» 


When  the  (k+l)-st  noise  spike  appears  on  channel  2,  the  output  of  the  adap. 
tive  beamformer  is 


y<‘>  =  4  (  1  ‘  Tc)k 


Each  time  a  new  noise  spike  appears  on  channel  2,  the  adaptive  filter  output 
is  (  1  -  4c/3  )  times  its  previous  value.  After  each  new  noise  spike,  the 
difference  between  the  optimum  filter  vector  and  the  old  filter  vector  is  re¬ 
duced  by  the  same  factor  (  1  -  4c/3  ).  In  this  simple  example  where  the 

no.se  crosscorrelation  matrix  NNT  has_a_s ingle  non-sero  eigenvalue,  the 

2 


average  squared  filtered  noise  output  y2(t)  is  minimized  when  c  =  3/4. 

In  this  case,  noise  is  completely  eliminated  starting  with  the  second  noise 
spike  on  channel  2.  When  0  <  c  <  3/4  ,  the  noise  output  always  has  the  same 
sign  as  the  spike  on  channel  2.  When  0  3/4,  however,  the  noise  output 


1-17 


ms&t 


alternately  differs  and  agrees  in  sign  with  the  input  spike  on  channel  2.  The 
magnitude  of  the  output  spike  always  diminishes  as  long  as  0  <  c  <  3/2.  When 
c  >  3/2,  on  the  other  hand,  the  adaptive  filter  vector  diverges  and  the  ABF 
output  increases  in  magnitude  with  the  arrival  of  each  new  noise  spike. 


In  this  idealized  example,  the  signal  does  not  affect  the  filter  update 


since 


S  -  S  = 


In  the  event  that  a  signal  spike  appears  simultaneously  on  all  channels,  the 
output  from  the  beamformer  is  s  since  the  filter  weights  sum  to  one. 

G.  ORGANIZ  ATION  OF  REPORT 

Section  II  describes  the  problem  of  floating  means  in  ALPA  data,  its 
effect  on  filter  performance,  and  the  techniques  used  to  eliminate  it.  Simple 
subtraction  of  the  channel  means  did  not  adequately  solve  the  problem.  This 

section  is  intended  for  data  analysts  and  programmers  who  must  deal  with 
means  in  the  input  data. 

Section  III  examines  the  effect  of  roundoff  error  in  an  integer- 
arithmetic  implementation  of  the  filter  update  equation.  Simple  approxima¬ 
tions  for  the  average  angle  of  error  in  the  vectors  (X-X)  and  (AneW-A°ld) 
are  presented.  The  angle  of  error  in  (X-X)  is  affected  by  the  number  of 

^new  ed0tldrePreSent  ^  ^  ValUeS  Xi(t"j)’  Whereas  the  angle  °*  error  in 
(A  -  A  )  is  also  influenced  by  the  number  of  bits  employed  to  represent 

the  filter  weights  a.(j).  The  simple  approximations  given  for  the  angular 

error  in  (X-X)  and  (AneW-  A°W)  are  probably  adequate  to  specify  the  data- 

value  and  filte  r  -  we  ight  representations  to  the  nearest  four  bits  when  directional 


1-18 


error  limits  are  placed  on  the  vectors  (X-X)  and  (AneW  -  A°ld).  This  sec¬ 
tion  is  intended  primarily  for  digital  design  engineers  and  programmers  who 
may  need  to  consider  roundoff  error  when  implementing  an  integer-arithmetic 
adaptive -filter  ing  system. 

In  Section  IV,  the  subject  of  investigation  is  the  ability  of  adaptive 
multichannel  filtering  to  provide  signal-to-noise  ratio  improvement  relative 
to  beamsteering  in  the  presence  of  background  noise.  Signal  degradation  and 
noise  reduction  measurements  from  real  data  are  combined  to  yield  the  as¬ 
sociated  signal-to-noise  gain.  The  convergence  rates  where  the  highest 
signal-to-noise  gains  occur  are  ascertained  in  this  way.  This  section  is  in¬ 
tended  to  assist  those  persons  evaluating  the  potential  usefulness  of  an 
adaptive -filter ing  system  for  processing  ALPA  data. 

Section  V  studies  the  effect  of  freezing  the  adaptive  filter  set.  Loss 
in  noise  reduction  is  determined  by  comparing  the  adaptive -filter  beam  out¬ 
put  when  the  filter  is  allowed  to  vary  with  the  adaptive -filter  beam  output 
when  the  filter  is  frozen.  Signal  degradation  is  remeasured  when  the  adap¬ 
tive  filter  set  is  frozen  in  order  to  estimate  the  resulting  signal-to-noise 
gain.  In  processing  the  data  samples  used  to  evaluate  the  filter-freeze  pro¬ 
cedure,  several  off-azimuth  events  are  also  run  through  the  adaptive  beam- 
former.  The  potential  improvement  in  interfering-event  situations  is  de¬ 
monstrated  as  a  byproduct  of  the  investigations  presented  in  this  section. 

The  results  of  this  section  are  intended  both  for  data  analysts  and  program¬ 
mers  interested  in  the  effect  of  the  filter-freeze  procedure  and  for  those 
persons  interested  in  the  effectiveness  of  the  adaptive -filter  algorithm  in 
processing  interfering  events. 

Section  VI  examines  noise  reduction  achieved  by  adaptive  filtering  for 
different  look  directions  and  discusses  variations  in  processing  gain  for  the 
ten  four-hour  noise  samples  processed  for  this  report.  This  section  is  in¬ 
tended  to  give  some  idea  of  the  variability  which  can  be  expected  in  the 


1-19 


performance  of  the  adaptive-filtering  process  when  background  noise  is  pre¬ 
sent.  This  information  should  be  useful  in  evaluating  the  potential  improve¬ 
ments  to  he  gained  through  adaptive  filtering. 

Section  VII  presents  the  conclusions  of  this  study.  For  those  with  in¬ 
sufficient  time  to  analyze  in  detail  the  results  of  the  individual  sections  in 
this  report,  this  section  gives  the  highlights  of  this  investigation. 


SECTION  II 


THE  PROBLEM  OF  FLOATING  MEANS 

A.  FLOATING  MEANS  IN  ALP  A  DATA 

In  data  recorded  prior  to  the  summer  of  1972,  some  of  the  digitized 
traces  transmitted  from  ALPA  had  mean  levels  higher  than  1000  computer 
counts.  These  mean  levels  were  contrasted  with  RMS  levels  in  tens  of  com¬ 
puter  counts  (after  mean  removal).  Furthermore,  these  DC  levels  gradually 
changed  over  a  period  of  a  few  hours.  Subtraction  of  a  fixed  DC  level  was 
insufficient  to  reduce  the  mean  to  a  level  significantly  below  the  RMS  level 
of  any  given  trace  because  of  the  gradually  changing  mean  levels.  Even  the 
removal  of  an  exponentially  smoothed  running  mean  produced  similar  results 
because  of  a  time  lag  between  the  mean  computation  and  mean  removal. 

The  floating  means  in  ALPA  data  are  caused  by  drift  in  the  pream¬ 
plifiers  at  the  individual  sites  of  the  array.  In  the  summer  of  1972,  the  orig¬ 
inal  preamplifiers  were  replaced  by  new  preamplifiers  of  different  design. 

The  effect  of  this  substitution  was  to  reduce  the  floating  mean  levels  by  an 
order  of  magnitude:  while  mean  levels  with  the  original  equipment  could 
sometimes  reach  thousands  of  computer  counts,  mean  levels  were  reduced 
to  levels  expressed  in  hundreds  of  counts  (typically  between  100  and  200  com¬ 
puter  counts)  with  the  change  in  instrumentation. 

Although  the  problem  of  floating  means  has  been  considerably  amelio¬ 
rated  by  the  dramatic  reduction  in  mean  levels  at  ALPA,  the  current  DC  levels 
still  lie  above  the  channel  RMS  amplitude  levels  after  mean  removal.  The 
same  problem  is  evident  at  the  NORSAR  array.  In  fact,  the  floating-mean 
problem  may  be  a  general  problem  with  long-period  seismometers. 


Consequently,  effective  techniques  for  removing  the  floating  DC  levels  appear 

to  be  essential  in  processing  long-period  data  with  adaptive-filtering  algo- 
rithms. 


On  a  quiet  noise 
from  adaptive  filtering  ( 
tently  obtained  until  the 
effectively.  The  chief  s 
formance  at  frequencies 
filter  length. 


sample  from  day  203  of  1971,  negative  improvement 
as  measured  in  terms  of  noise  reduction)  was  consis- 
problem  of  floating  means  was  studied  and  dealt  with 
ymptom  of  the  trouble  was  a  particularly  poor  per- 
below  the  frequency  equal  to  the  reciprocal  of  the 


Subsection  B  presents  a  theoretical  study  of  the  effects  of  DC  bias  on 
maximum-likelihood  adaptive  filtering.  In  Subsection  B,  the  results  are  de¬ 
rived  using  the  fixed-filter  maximum-likelihood  multichannel-filter  design 
equations.  Since  the  adaptive -filter  solution  converges  in  the  mean  to  the 
fixed-filter  solution,  similar  if  not  precisely  identical  results  can  be  expected 
in  the  case  of  adaptive  filtering.  Empirical  verification  of  this  fact  can  be  de¬ 
rived  from  the  elimination  of  the  predicted  floating -mean  performance  symp¬ 
toms  upon  removal  of  the  floating  means  from  the  data. 


■ms 


ire- 


Subsection  C  describes  the  measures  taken  to  eliminate  problei 
associated  with  DC  bias.  A  necessary  remedy  was  the  application  of  a  pi 
filter  with  precisely  zero  response  at  0  Hz.  In  addition,  computational  bias 

in  the  adaptive-filter  computer  program  had  to  be  reduced  to  the  maximum 
extent  possible. 


B.  EFFECT  OF  DC  LEVELS  ON  ADAPTIVE  FILTERING  IMPRO  /EMENT 

The  general  design  equation  for  two-channel,  (2N+1) -point  maximum- 
likelihood  multichannel  filter  set  is 


II-2 


^n<°> 

*zl(0) 


V1* 

* n M 


V°> 

^22(°) 


♦i2<") 

^(i) 


H  hi(-D  012(-1)  °1  rV»u(-2N)  012(-2N)  5 
1  ^22(_1)  °  ““  ^21(“2N)  <*?2(_2N)  ° 


0  >n(0)  012(O)  ll  r0n(l-2N)  <#>12(1-2N)0 

°  021(O)  022(O)  1 - <^1(1-ZN)  022(1-2N)O 


011(2N)  012(2N)  °1  p>n(2N-l)012(2N-i)  6]  T<*>n<o)  <Pn( 0)  1 

021(2N)  ^22(2N)  °  021(2N_1)022(2N"1)  °“‘  ^i(O)  ^2(0)  1 


aj(-N) 

a2(-N) 

-X(-N) 


aj(0) 

a2(0) 


aj  (N) 
a2(N) 
-X(N) 


where  </>„( T )  denotes  the  c ros scorr elation  function  between  channel  i  and 
channel  j  at  time  lag  t,  a^(r)  denotes  the  filter  point  for  channel  j  at 
time  lag  r,  and  X(r)  denotes  the  Lagrangian  multiplier  associated  with 
the  maximum-likelihood  constraint  condition  imposed  upon  the  filter  weights 
for  time  lag  r .  The  Lagrangian  multiplier  X(0)  for  zero  lag  is  equal  to 
the  mean  square  error  and  noise  power  output  of  the  filter  set.  A  derivation 
of  the  design  equation  is  given  in  part  2  of  Subsection  III-D. 

Assume  two  traces  each  consist  of  a  mean  m.  with  white  random 

l 

noise  of  power  p.  .  Then 

l 

0.j(r)  =  m.m.  r  ^  0 

0.  (0)  =  m.m.  i  ^  j 

0.  .(0  )  =  m2  +  p . 

n  l  Fi 


1  1 


m  4 


o 


0  0  0 
0  0  0 
0  0  0 


and 


A(r) 


a2(T)  * 
-A  (T) 


A(t)  = 


or 


where  6  denotes  the  Kronecker  delta. 


The  design  equation  for  a  symmetrical  two-channel,  (2N+l)-point 
maximum -likelihood  filter  set,  for  the  noise  field  specified,  reduces  to 

I  T  -I  i-  -.r-  _  S 


p  o  -. - o  o 

N 

M 

[mt  mI~ mt  mt] 

A(-N) 

A(-N) 

o — 

O  / 

/ 

o - 

+ 

M 

i 

i 

i 

> 

1 

A(0) 

_ 

1 

1 

A(0) 

o  o  spvo 

M 

i 

i 

i 

l 

i 

1 

o  O - -O  P 

M 

a(n  ) 

1 

A(  N  ) 

Let  n  denote  the  (2N+1)  by  (2N+1)  block  matrix  with  diagonal  element  P.  Let  p 
denote  the  (2N+1)  block  column  vector  with  element  M.  Then  the  total  noise 

_  T 

matrix  is  c|^  ~  +nn  . 

0-i  =  n-i  .  (n-Wi^rr1) 

(i  +  jiTn  ~lw 


n 


i 


P 

■  ov - o 

o 

O 

i  >> 

P.l'\o 

S  V.  V 

o 

1 

1 

1 

S  V  I 

o  x''p-f'o 

s 

O 

o 

o - -o 

p 

II- 4 


M 

P-1M 

n -lu ,  r1 

M 

i 

| 

— 

P^M 

| 

1 

M 

-‘1 

P  M 

M 

P_1M 

[ 


T  T  T  T 

M  M - M  M 1 


] 


T  -1 

V  n  V 


P_1M 


■1, 


P  M 

I 

I 

P_1M 


1. 


P  M 


n“1/i/iT  n"1 


P_1M 


p'/m 

i 

i 

P_1M 


1 


P  M 


[ 


=  (2N+1)MTP_1M 


mtp  1  mV1 - mtp-1  mV1 


P_1M 


Pl+  P2 


1 

-1 


LP2 


1  pl 

P1  -plp2 


m . 


m. 


Pl+  p2 


ml  -  m2 


m2  ‘  ml 


p2mi+pim; 


T  -1 
M  P  M 


j  [mlm2°] 


pl+p2 


- 1  T  -  1 
P  MM  P 


(m 


mZ} 


(pl  +  Pj* 


1 

■1 

B 


m,  - 

m„ 

1 

2 

m_  - 

m , 

2 

1 

P2mi 

+  P 

1  -1 

*] 

12 


(ml  -  m2r 


P1  +  p2 


II- 5 


? 

1 


where  ft 


P2ml  +  Pim2 


mi"m2 


$_i  =  n* 


(n~lumpTnl) 

(i+j/VVo 


T  ,  (2N+  l)(m  -  m  )‘ 

i  n_1jLi  =  1  + - - - £_ 

Pi  +  p2 


(Pi+p2)p‘ 


Pi+p2 


9*^  <Pi+p2^ 

i  "  ^  1  L 

i 

o 


(p,+p2)p 


o 

I 

I 

-1  ""o 


(mi  -  m2y 


+  PJ  +  (2N  +  1)  (m  -  m  )"| 
C  1  2  J 


-O  (Pl+p2)p 
E  E-v - E  El  ' 

?ve\e  ¥ 

!  -  N  J 

E  E Nv  "E  E 

X 

E  E - -E  E  . 


whe  re 


1  -1  0 

E  =  -1  1  -0 

_fi  -Q  0‘ 


Pi+P2 


D  O- - O  O 

Os  D  V^N  o  O 

1  v  v  x  I 

I  N  %  I 

I  \  v  v  '.I 

O  O  v  D  o 


O  O- 


■O  D 


E - E 


E  E  n  E 

I  \  ^  v 

!  v  v  \ 


E - -E 


where 


and  y  = 


P1  'P1P2 


-  (m1  -  m2) 

(Pl  +  p2)  +  (2N+l)(ml-  m^)' 


II- 6 


a  Tt0) 

0 

M  " 

«» 

-H- 

O  O 

L  *  ' 

1 

Pl+P2 

yE 

- 

0 

1 

L  J 

1 

"  pl+p2 

-ey 

b2  y 

a^O) 

1 

0 

p2+  By 

a2(°) 

d  +  y  e 

0 

1 

pr  By 

"  Pl+P2 

"  Pl+P2 

-A  (0) 

1 

-pip2+^y_ 

The  mean  square  error  and  noise  power  output  are  equal  to  A(0). 


MO)  = 


pip2  -*  y 

Pl+P2 


P,Po  + 


(p2m1+  Pjm  )‘ 


+d 

1  2  (  (P1+P2)+(2N+1)(m  ^  m^) 


Under  the  assumption  that  the  white  noise  levels  are  the  same  for  both  channels 
P^  =  P2  =  P  and  the  mean  square  error  is 

(m  i  +  m2)2 


A(°)  =  § 


1  + 


2p  +  (2N+l)(m  -m  ) 

A  L 

The  noise  output  power  for  a  beamsteer  system  applied  to  the  same  noise  field  i 

5[(pTp2>  +  (mi+m2>2] 


or  _  ? 

r  i  \  l 


lf  P1  =  P2  =  P"  Since  the  quantity  (2N+l)(m1  -  m^)  is  always  non-negative,  the 
maximum-likelihood  filter  set  reduces  the  noise  output  power  to  a  level  at  least 
as  low  as  the  noise  output  power  of  a  beamsteer  system.  Implicit  in  the  design 
equations,  however,  is  the  assumption  that  "power"  due  to  DC  bias  is  as 


II-  7 


undesirable  as  power  due  to  fluctuations  in  the  traces.  In  practice,  the  analyst 
easily  "filters  out"  DC  bias  when  visually  inspecting  a  seismic  trace. 


Some  important  effects  occur  as  a  result  of  the  means  in  the  traces. 
The  filter  weights  are 


a^rj^O)  = 

pi+p2 

a2(i-/0)  = 

p!+p2 

a1<0)  = 

P2 

Pl  +  P2 

+ 

pl+p. 

a2(°)  = 

P1 

By 

Pl+P2 

p!+p. 

Q  is  set  to 

«y/<pj 

+  p2),  £ 

N 


A 


■  (f)  =  7  +  a  52  e"l27r£^AI 

i=-N 


and 


N 


If  o~(  f )  is  set  to^~N 
likelihood  filter  set  is 


A2^f  )  =  J  ~  a  ^  e 

M  /=- N 

N  * 

-i27rf /At 


-i27T  l^At 


,  the  noise  output  power  density  of  the 


maximum- 


Al<f>A*(f)] 


P(f )  0 

0  P(f 


Al(f 


=  [  4  +  a<r(f )  +  a2o-2(f  )Jp(f ) 

+  [\  -  0(r(£  )  +  a2cr2(f  )J  P(£ 


II- 8 


. t*MH*#**tM*~~<1**ita 


=  ^[P(f  )  +  P(f  )] 

p(f )  r,  x  ,„2„-2 

-  ~  L1  +  40  r 


+  aor(f  )[p(f )  - 
+  a2cr2(f)[p(f) 


P(f )] 

+  P(f)] 


as  compared  with  P(f  )/2  noise  output  power  density  for  a  beamsteer  system. 
Thus,  at  any  frequency  other  than  DC,  the  beamsteer  system  reduces  the  noise 
level  at  least  as  much  as  the  maximum- likelihood  filter  set.  In  fact 

g-(f)  =  V“>  c -l277T  Mt  _  sin  (2N+1)7T  fAt 

/  v  sin7Tf  A  t 


and  CT(f)  =  0  when  f  =  (k  =  1 , 2,  .  .  .  2N) 

The  first  zero  of  this  function  occurs  at  f  =  l/(2N+l)At,  and  the  function  climbs 
monotonically  until  it  approaches  the  value  2N+1  as  the  frequency  approaches 
zero  from  the  right.  The  quantity  (2N+l)At  is  the  effective  length  of  the  filter. 
This  result  provides  a  possible  explanation  for  the  poor  performance  of  the 

maximum- likelihood  filter  set  at  frequencies  below  the  reciprocal  of  the  filter 
length. 


C.  TECHNIQUES  USED  TO  ELIMINATE  DC  BIAS  FROM 
THE  DATA 


The  first  step  m  removing  DC  bias  from  the  traces  was  to  apply  single¬ 
channel  prefilters  to  each  of  the  site  traces  after  rotation  to  the  vertical  trace. 

A  31 -point  convolution  filter  was  used  for  this  purpose.  The  weights  for  this 
filter  are  shown  m  Table  II- 1  .  Data  points  are  scaled  by  2~15  before  exiting 
from  the  convolution  filter  microcode  which  implements  the  filter.  The  frequen¬ 
cy  response  (after  scaling)  is  shown  in  Figure  II- 1.  The  fact  that  the  filter 

weights  sum  to  zero  guarantees  that  the  response  at  DC  is  exactly  zero  or  -  oo 
in  dB  . 


II- 9 


■  |[p(f)  +  P(f)J  +  atr( f ) [p < f )  -  p(t)l 

+  «V(fl[p([)  +  P(f)l 

•^[l  +4«V(1,1 


■^[i  mM«|] 

as  compared  with  P(f  )/2  noise  output  power  density  for  a  beamsteer  system. 
Thus,  at  any  frequency  other  than  DC,  the  beamsteer  system  reduces  the  noise 
level  at  least  as  much  as  the  maximum-likelihood  filter  set.  In  fact 


1  N 

/=-N 


-lZm  iAt  _  sin  (2N+D7T  fAt 


sm7Tf  A  t 


Cr(  f  )  =  0  when  f  = 


At(2N+l) 


(k  =  1,2,  .  .  .2N) 


The  first  zero  of  this  function  occurs  at  f  =  l/(2N+l)At,  and  the  function  climb 
mono  tonic  ally  until  it  approaches  the  value  2N+1  as  the  frequency  approaches 
zero  from  the  right.  The  quantity  (2N+l)At  is  the  effective  length  of  the  filter. 
This  result  provides  a  possible  explanation  for  the  poor  performance  of  the 

maximum-likelihood  filter  set  at  frequencies  below  the  reciprocal  of  the  filter 
length. 


C.  TECHNIQUES  USED  TO  ELIMINATE  DC  BIAS  FROM 
THE  DATA 


The  first  step  in  removing  DC  bias  from  the  traces  was  to  apply  single¬ 
channel  prefilters  to  each  of  the  site  traces  after  rotation  to  the  vertical  trace. 
A  31-point  convolution  filter  was  used  for  this  purpose.  The  weights  for  this 
filter  are  shown  in  Table  II- 1.  Data  points  are  scaled  by  2"15  before  exiting 
from  the  convolution  filter  microcode  which  implements  the  filter.  The  frequen 
cy  response  (after  scaling)  is  shown  in  Figure  II- 1.  The  fact  that  the  filter 

weights  sum  to  zero  guarantees  that  the  response  at  DC  is  exactly  zero  or  -  oo 
in  dB  . 


II- 9 


* 


The  filter  of  Table  II- 1  and  Figure  II- 1  was  obtained  from  a  computer 
program  which  minimizes,  for  a  discrete  convolution  filter  of  specified  length, 
the  weighted  mean  square  error  (integrated  over  the  Nyquist  band)  between  the 
desired  frequency  response  and  the  convolution-filter  frequency  response. 

After  the  initial  run,  several  iterations  with  altered  frequency  weightings  and 
desired  responses  were  made  to  reduce  the  ripple  in  the  filter  response.  Next 
the  desired  response  at  0  Hz  was  modified  so  as  to  reduce  the  amplitude  at  DC. 
After  a  response  below  -100  dB  was  achieved  at  0  Hz,  the  filter  coefficients 
were  rounded  to  the  nearest  1/32,768.  Finally,  the  zero  lag  weight  of  the  con¬ 
volution  filter  was  adjusted  by  a  multiple  of  1/32,768  so  that  the  sum  of  the 
filter  coefficients  was  zero  (yielding  a  response  of  -  oo  dB  at  DC). 

When  the  program  changes  to  implement  this  filter  were  made,  dif¬ 
ficulties  persisted.  A  detailed  study  of  the  adaptive-filtering  subroutine  show¬ 
ed  that  1/2  count  of  negative  bias  per  channel  was  being  introduced  by  the  con¬ 
volution-filter  microcode  because  of  truncation.  The  subroutine  was  rewritten 
to  compensate  for  this  bias  in  the  beam  output  trace  and  to  round  intermediate 
results  wherever  possible.  After  these  modifications,  the  DC  bias  problems 
were  eliminated. 


SECTION  III 


EFFECT  OF  ROUNDOFF  ERROR  IN  THE  FILTER  UPDATE  EQUATION 

A.  DISCUSSION 

In  the  adaptive-filtering  investigations  conducted  for  this  study,  the 
bulk  of  the  adaptive -filter  update  calculations  and  the  adaptive  filter  output 
computations  were  performed  using  integer  arithmetic.  The  reason  for  in¬ 
teger  arithmetic  was  the  desirability  of  operating  in  an  on-line  mode  on  the 
IBM  360/40  computers  at  SDAC  using  their  special  high-speed  microcode 
operations.  In  the  construction  of  hardware  specifically  tailored  to  imple¬ 
ment  adaptive  beamforming,  economic  considerations  seem  to  dictate  that 
integer  arithmetic  be  used.  For  this  reason,  the  experience  gained  during 
this  study  in  the  problems  arising  from  integer -arithmetic  roundoff  error 

may  well  be  applicable  to  a  variety  of  economically  practical  adaptive -filtering 
systems. 

The  objective  of  this  section  is  to  obtain  quantitative  estimates  of  the 
directional  error  in  the  filter  update  vector  as  a  function  of  the  limited  num¬ 
ber  of  parameters  controlling  the  directional  error.  Some  of  these  para¬ 
meters,  namely  the  number  of  bits  used  to  represent  the  input  data  points 
and  the  filter-weight  coefficients,  are  subject  to  direct  human  control.  Other 
parameters  depend  either  directly  or  indirectly  on  the  characteristics  of  the 
data  at  any  given  array.  Once  directional-error  tolerances  can  be  specified 
with  sufficient  precision  to  avoid  significant  degradation  of  adaptive -filter 
performance,  noise  field  measurements  at  an  array,  together  with  the  direc¬ 
tional-error  approximations  given  in  this  section,  should  provide  useful  esti¬ 
mates  of  the  number  of  bits  needed  for  the  input  data  points  and  the  filter- 


weight  coefficients.  Even  estimates  accurate  only  to  the  nearest  four  bits 
are  valuable:  it  is  extremely  useful,  in  designing  adaptive-filtering  hardware, 
to  know  whether  12  bits,  16  bits,  20  bits,  24  bits,  etc,  are  needed  to  repre- 
sent  adequately  the  data  and  adaptive -filter  coefficients. 

In  this  section,  the  computational  error  will  be  traced  through  the 
series  of  integer-arithmetic  operations  performed  in  updating  the  adaptive- 
filter  weights.  At  this  point,  these  operations  will  be  reviewed.  The  filter 

update  procedure  is  a  vector  operation  which  can  be  written  compactly  in  the 
vector  equation 

old  2K  XTAoId(X-X) 

A  =  A  +  — — - _ 

* 

(X-X)T(X-X) 

which  describes  the  particular  algorithm  used  in  this  study.  The  term  K 

is  a  scalar  quantity  called  the  convergence  factor.  It  controls  the  adaptation 

rate  of  the  adaptive -filtering  process.  The  dot  product  XTA°ld  denotes  the 

series  of  multiply-and-add  operations  which  yield  the  adaptive  beamformer 

output  y(t).  The  superscript  T  denotes  vector  transposition.  The  vectors 
.new  old  — 

,  X,  and  X  (as  well  as  their  constituent  components)  were  defined 
previously  in  subsections  I-D  and  I-E.  In  the  algorithm  actually  employed,  the 
denominator  (X-X)T(X-X)  is  approximated  by 


iVI 

(2N+1)  XI  Pi( 


where  P.(t)  is  a  moving  power  average  of  the  difference  between  the  i-th 
channel  and  the  beamsteer  output  (see  page  1-8).  In  this  approximation,  M 

is  the  number  of  input  channels  and  (2N+1)  is  the  total  number  of  filter 
weights  per  channel. 


Ill  -  Z 


Since  the  denominator  <X-X)T(X-X)  and  the  convergence  factor  K 
are  positive  ^scalar^quantities,  the  direction  of  motion  of  the  filter  update 
vector  (A  6W  -  A°  )  is  determined  by  the  sign  of  the  adaptive  filter  output 
y(t)  *  *  SCalar  and  the  vector  (X-X).  Directional  error  in  the  vec¬ 

tor  (X-X)  contributes  to  directional  error  in  the  filter  update  vector.  Errors 
in  the  data  vector  X,  moreover,  may  change  the  sign  of  the  adaptive  filter  out¬ 
put  y(t)  =  X  A°  or  cause  it  to  round  to  zero:  in  the  first  case,  the  filter 
vector  moves  in  a  direction  which  tends  to  increase  the  average  squared  filter 
output  y  (t)  ;  in  the  second  case,  the  adaptive  filter  vector  cannot  move.  After 
multiplication  of  the  vector  (X-X)  by  the  scalar  quantity 

2K  XTA°ld 
_ s 

(X-X)T(X-X) 

the  individual  components  of  the  resultant  scaled  vector  are  rounded  to  the 
nearest  integer  in  the  numerical  representation  of  the  filter  weights  and  then 
added  to  the  corresponding  components  of  the  old  filter  vector.  The  filter- 
weight  roundoff  process  just  described  introduces  further  directional  error 
into  the  filter  update  vector.  The  filter-weight  roundoff  error  can  be  made 
as  small  as  desirr  '  using  a  sufficient  number  of  bits  in  the  numerical 
representation  of  w  niter  weights.  A  similar  capability,  however,  is  not 
possible  in  computing  the  adaptive  filter  output  y(t)  ,  where  the  preservation 
of  a  non-zero  value  with  the  correct  sign  is  ultimately  limited  by  the  quanti¬ 
zation  error  introduced  by  the  digitization  system. 

The  principal  questions  to  be  answered  in  determining  the  error  in 
the  direction  of  movement  of  the  filter  weight  vector  are,  therefore, 

•  How  much  directional  accuracy  is  maintained  in  the  vector 

(X-X)  after  machine  computational  procedures  have  been  per¬ 
formed? 


III- 3 


•  How  often  do  computational  and  digitization  errors  reverse  the 
sign  of  the  adaptive  filter  output  or  result  in  a  zero  value  for 
the  adaptive  filter  output  when  it  is  rounded  to  the  nearest  in¬ 
teger? 

•  What  precision  is  required  in  the  numerical  representation  of 
the  filter  weights  to  achieve  acceptable  error  levels  when  the 
filter  update  vector  is  added  to  the  old  filter  weight  vector? 

Subsections  B  and  C  examine  error  in  the  vector  (X-X)  .  Subsection 
B,  a  necessary  prelude  to  subsection  C,  discusses  errors  in  a  single  com¬ 
ponent  of  the  vector  (X-X).  Error  is  traced  through  the  individual  processes 
leading  to  the  final  value  x(t-j)  -  x.(t-j).  Probability  densities,  standard  de¬ 
viations  of  error,  and  maximum  errors  are  presented  for  rotated  data,  pre¬ 
filtered  data,  the  beamsteer  output,  and  the  beamsteer  output  minus  a  single 
channel. 

After  the  preliminaries  of  subsection  B,  directional  error  in  the  vec¬ 
tor  (X-X)  is  investigated  in  subsection  C.  The  error  E  in  (X-X)  is  sep¬ 
arated  into  a  part  E^  satisfying  the  filter -weight  constraint  conditions  and 
a  part  perpendicular  to  the  vector  space  corresponding  to  the  filter- 

weight  constraints.  Probability  distributions  for  the  angle  of  error  in  (X-X) 
and  the  angle  of  error  in  its  projection  onto  the  constraint  space  are  calculated. 

Subsection  D  considers  the  effect  of  digitization  and  roundoff  error  on 
the  adaptive  filter  output  y(t).  The  probability  that  y(t)  rounds  to  zero  and 
the  probability  that  y(t)  changes  sign  are  estimated.  Since  these  probabil¬ 
ities  depend  on  the  filter  vector  A  ,  a  derivation  of  the  maximum-likelihood 
filter-design  equations  is  included  so  that  the  filter  vector  may  be  determined 
from  the  data  crosscorrelation  matrix. 

Subsection  E  studies  the  consequences  of  rounding  off  the  updated  fil- 
new  _ 

ter  vector  A  .  To  evaluate  the  likelihood  that  the  roundoff  process 


immobilizes  the  filter  vector  by  preventing  any  change  in  the  individual  com¬ 
ponents,  an  expression  for  the  squared  magnitude  |  A°eW-  A°ld  |  2  is  employ¬ 
ed.  To  estimate  the  mean  angle  of  error  in  the  vector  AneW-  A°^d,  the  pro¬ 
bability  distributions  associated  with  rounding  off  the  filter  weights  are  de¬ 
termined.  Finally,  two  different  techniques  for  eliminating  discrepancies 
in  the  maximum-likelihood  constraints  are  appraised  in  terms  of  the  result¬ 
ant  directional  error  in  the  filter  update  vector. 

Subsection  F  is  a  summary  of  the  results  in  this  section. 

B.  ERRORS  ASSOCIATED  WITH  A  SINGLE  COMPONENT  OF  THE 
VECTOR  (X-X) 

As  a  first  step  in  estimating  the  directional  error  of  the  vector  (X-X), 
the  errors  associated  with  a  single  component  must  be  established.  In  order 
to  evaluate  the  directional  error,  it  is  necessary  to  distinguish  between  error 
in  one  component  of  (X-X)  due  to  inaccuracies  in  the  quantities  x.(t-j)  prior 
to  forming  the  beamsteer  output  and  error  arising  solely  from  roundoff  error 
during  computation  of  the  beamsteer  output  x(t-j)  for  time  t  -  jAt.  This 
distinction  is  necessary  because  error  prior  to  generating  the  beamsteer  out¬ 
put  cannot  dislodge  the  updated  filter  vector  AneW  from  the  subspace  cor¬ 
responding  to  the  maximum-likelihood  constraints.  Conversely,  roundoff 
error  in  calculating  the  beamsteer  output  forces  motion  perperdicular  to  the 
constraint  space.  A  detailed  description  of  this  phenomenon  will  be  given  at 
a  more  appropriate  point  in  this  section. 

In  analyzing  errors  at  the  single-component  level,  the  digitiz ation  pro¬ 
cess  is  presumed  to  introduce  an  error  with  a  uniform  probability  density  be¬ 
tween  plus  and  minus  one-half  count.  There  is  one  realistic  situation  where 
this  premise  is  not  justified.  The  ALPA  data  values  are  represented  as  16- 
bit  gain-ranged  numbers  with  a  12-bit,  two 's -complement  fraction  and  a  foir- 
bit  negative  exponent.  When  the  DC  level  on  any  triax  component  rises  to  2048 

III- 5 


computer  counts  or  more,  the  digitization  error  is  effectively  multiplied  by 
the  appropriate  power  of  two.  For  the  data  samples  processed  in  this  report, 
one  or  two  of  the  six  channels  input  to  the  adaptive  processor  might  have  float¬ 
ing  means  of  this  size.  To  achieve  simplicity  of  presentation,  this  fact  is  ig¬ 
nored.  In  neglecting  this  possibility,  the  digitization  error  is  underestimated. 
This  underestimation  should  not  materially  affect  the  results  obtained  in  this 
section. 

A  different  assumption  is  made  for  computational  roundoff  error.  When 
an  intermediate  result  must  be  rounded  to  the  nearest  count,  the  resultant 
roundoff  error  is  assumed  to  lie  between  -1/2  and  +1/2  at  equally-spaced  dis¬ 
crete  points  centered  about  zero.  If  none  of  these  discrete  points  is  located 
one-half  count  from  zero,  they  are  assumed  to  equally  likely.  If  an  error  of 
one-half  count  is  possible,  each  of  the  end  points  is  assumed  to  be  half  as 
likely  as  each  of  the  interior  points.  Up  to  and  including  the  point  where  the 
adaptive  channel  data  points  x^t-j)  are  averaged  to  form  the  beamsteer  out¬ 
put,  the  error  of  each  quantity  involved  in  a  summation  is  assumed  to  be  in¬ 
dependent  of  the  error  in  the  other  quantities  involved  in  the  summation.  In 
the  case  of  independent  summand  errors,  the  probability  density  function  for 
the  error  in  the  sum  is  the  convolution  of  the  probability  density  function  for 
the  error  due  to  roundoff  with  that  of  each  individual  summand  error. 

When  the  possibility  that  roundoff  error  could  be  affecting  adaptive 
filter  performance  was  first  considered,  an  experiment  was  conducted.  First, 
with  the  convergence  factor  Kg  set  to  0.  30  ,  a  noise  sample  from  a  relative¬ 
ly  quiet  summer  day  was  run  through  the  adaptive  filter  program.  Noise  in 
the  adaptive  filter  output  was  reduced  by  2  dB  relative  to  the  beamsteer  out¬ 
put.  Second,  the  data  points  were  multiplied  by  16  before  processing.  At 
the  same  convergence  rate,  noise  reduction  was  approximately  6  dB.  It  was 
apparent,  therefore,  that  roundoff  error  was  significantly  affecting  processor 


III -6 


performance.  Scaling  the  data  by  a  factor  of  16  was  sufficient  to  achieve  noise 
reduction  comparable  to  that  obtained  for  the  same  noise  sample  by  a  separate 
computer  program  using  floating-point  arithmetic. 

In  this  section,  error  will  be  examined  for  the  case  of  unsealed  data, 
data  scaled  by  a  factor  of  16,  and  data  where  the  floating-point  arithmetic  is 
used  in  the  adaptive  processor.  When  floating-point  arithmetic  is  employed, 
roundoff  error  is  assumed  to  be  negligible,  and  error  control  is  ultimately 
limited  by  digitization  accuracy.  A  comparison  of  the  error  in  unsealed  data 
and  data  scaled  by  a  factor  of  16  shows  the  error  reduction  achieved  by  scaling 
the  data.  A  comparison  of  the  error  in  the  scaled  data  and  the  data  processed 
using  floating-point  arithmetic  indicates  how  closely  the  scaled  data  approaches 
ultimate  achievable  precision. 

In  the  first  stage  of  processing,  three-component  triax  data  at  each  site 
are  combined  to  form  a  vertical  component  by  multiplying  each  triax  component 
by  one  third  and  summing  all  three  components  (rounding  to  the  nearest  count 
when  the  vertical  component  is  output).  In  the  case  of  unsealed  data,  the  error 
in  each  summand  has  a  uniform  probability  density  between  plus  and  minus  one 
sixth  (from  digitization  error),  and  a  roundoff  error  with  equal  probability  at 
the  three  discrete  points  -1/2,  0,  and  1/3.  Since  the  probability  density  func¬ 
tion  is  the  convolution  of  four  separate  probability  density  functions,  the  result¬ 
ant  probability  density  function  has  a  Fourier  transform  which  is  the  product 
of  the  individual  transforms  for  each  of  the  four  separate  independent  probabil¬ 
ity  density  functions.  For  a  uniform  density  between  -c/2  and  c/2  ,  the 
Fourier  transform  is 


-iZTTsi 


-i7Tcs 


dy  = 


i77cs 


-i27Tcs  -i27Tcs 


sin7Tcs 

77c  s 


(or  1  if  s  =  0  ) 


=  sine  cs, 


where 


sine  s  = 


sin7Ts 

ir  s 


(  or  1  if  s  =  0  ) 


For  a  finite-comb  probability  density  function 


p(y)  =  — 

n 


Z 


j  =  -  (n-1) 
2 


where  n  is  an  odd  integer  and  g  is  the  Dirac  delta  function, 
transform  is 


the  Fourier 


P(s)  = 


■/ 


/  ,  -i27Tsy  1 

p(y)  e  ;  dv  =  - 


•i27Tjs 


Z 


j=-(n- 1) 
2 


The  transform  is  a  geometric  progression,  so  that 


^  iTTs  -i7rs^ 


P(s)  = 


III- 8 


T  ‘  1 


J  '  2 


■±  t 


i27Tj  s 


i27Tjs 


j  =  -«T  -  *> 


P(s)  =  — 


i7Ts 

e  -  e 


(ei7rs 

r  irrs 

l  n 

\e 


•i7TsJ 


-^/Zi 


sin7Ts 

n  sin(7Ts/n) 


(or  1  if  s  is  an  integer  multiple  of  n). 


If  the  function  *„(.)  is  defined  for  odd  integers  as  the  Fourier  transform 
just  computed  for  the  finite-comb  probability  density  function,  the  Fourier 
t  ansform  of  the  error  in  the  vertical  component  for  unsealed  data  is 

sine  (s/3)  «3(s)  .  The  maximum  possible  error  is  5/6  count.  In  the  c as 

of  data  scaled  by  a  factor  of  16,  the  digitisation  error  is  multiplied  by  16  dur 
mg  SCaling>  bU‘  ro“"d°«  ”ror  is  still  the  same.  The  Fourier  transform  of 

the  ensuing  error  probability  density  function  is  sinc3(l  6s/  3)  HC  (s)  The 

maximum  possible  error  is  8  1/3  counts.  (At  the  same  time  that  the  absolu, 
error  is  higher  because  of  scaling,  the  data  points  are  16  times  larger,  so 
that  the  relative  error  is  reduced.  )  For  data  going  through  a  floating-point 
processor,  there  is  negligible  roundoff  error.  The  Fourier  transform  of  the 
corresponding  probability  density  function  is  sinc3(s/3).  The  maximum 
possible  error  is  1/2  count.  Inverse  Fourier  transforms  graphing  the  error 
probability  density  per  count  for  unsealed  data,  data  scaled  by  16.  and  data 


III- 9 


passing  through  a  floating-point  processor  are  presented  in  Figure  111-1.  The 
physical  size  of  the  vertical  scale  was  increased  by  16  and  that  of  the  horizon¬ 
tal  scale  reduced  by  16  on  the  page  in  the  case  of  data  scaled  by  16  to  permit 

meaningful  comparison  between  the  three  modes  of  processing  for  the  rela- 
tive  error  in  the  vertical-component  output. 

In  the  second  stage  of  processing,  the  vertical-component  data  traces 
e  funneled  through  a  filter  with  exactly  zero  response  at  DC.  The  filter 
weights  (scaled  by  2  )  for  this  filter  were  given  previously  on  page  11-9.  The 

equation  which  describes  the  filter  implementation  is 


xtt)  = 


j=  -15 


bj  vi  (t  -  j)  , 


where  x.(t)  is  the  prefiltered  vertical-component  trace  for  the  i-th  site  at 

time  t,  b.  is  the  filter  weight  for  the  j-th  lag,  and  v  (t  -  j)  is  the  vertical- 
component  input  tr ace  for  the  i-th  s ite  at  time  t-jAt.  Since  the  filter 
weights  are  all  multiples  of  2’15.  the  roundoff  error  presumed  (o  ^  ^  ^ 

of  the  32769  equally-spaced  points  between  plus  and  minus  one-half  count.  Both 
of  the  end  points  are  presumed  to  be  half  as  likely  as  any  of  the  interior  points 
so  that  the  probability  density  function  for  the  roundoff  error  is 


III-  10 


(a)  Unsealed  Data 


FIGURE  III- 1 


ERROR  PROBABILITY  DENSITY  PER  COUNT  AFTEI 
FORMATION  OF  VERTICAL  COMPONENT 


where  n  is  32768.  The  corresponding  Fourier  transform  is 


-i27Tsy 

P(y)  e  dy 


La 

e 


J  '2 


i27Tjs 

n 


—  _  1 

2  i27Tjs 


E 


j=-(T- 1) 


which  is  the  sum  of  two  geometric  progressions.  Hence 


^  HIJL  i7Ts  ^ 


;  -  e  /  P(s) 

i7T(n+l  )s  i7T(n-  1  )s 

1  n  n 

e  +  e 

2n 

(iTTs  ins  \  /  ._ 

e'-)(eI,rE 


i7T(n- 1  )s 
n 


i7T(n+I  )s 
n 


so  that 


P(s)  = 


lie  +  e  1  \e 


i7Ts 

-ins\ 

-  e  //2i 

ins 

i77^\ 

n 

-  e  "#2  i 

r  si^5,  ,i  c=s  (*l) 

n  sin{7Ts/ n)  \  n  /  , 


or  1  when  s  is  an  integer  multiple  of  n  .  The  function  JK  (s)  for  even 

n 

integers  is  defined  as  the  expression  just  given.  Under  the  assumption  that 
the  errors  in  the  separate  vertical-component  input  points  are  mutually  in¬ 
dependent  and,  further,  independent  of  the  roundoff  error  at  output,  the  Four¬ 
ier  transform  of  the  resulting  error  probability  density  function  for  unsealed 
data  is 


III-  1  2 


Ai 


32768(s) 


15 


3K_  (b.s). 
^  J 


Since  the  prefiiter  is  symmetric  about  zero,  b  . 
transform  above  may  be  written 


b.  and  the  Fourier 
J 


*  32768(s)  si"c 


(¥) 


5K  (b  s) 

j  o 


The  maximum  possible  error  for  unsealed  data  is 


counts  or  slightly  less  than  2  1/6  counts  since  the  sum  of  the  filter  weight 
absolute  values  is  slightly  less  than  2  .  For  data  scaled  by  16,  the  Fourier 
transform  of  the  error  probability  density  function  is 


32768 


(s )  s inc 


(16b  s  v 


JK  ^(b^s) 


15 

11 

j  =  l 


and  the  maximum  possible  error  is 


b 

J 


) 


counts  or  slightly  less  than  17  1/6  counts.  For  data  processed  with  floating¬ 
point  arithmetic,  the  corresponding  Fourier  transform  is 


| 


t 

rk 

it 

I 

t 


III-  1  3 


and  the  maximum  possible  error  is 


/  15 

r  (Z 

\j=  -15 


counts  or  slightly  less  than  1  count.  Inverse  Fourier  transforms  giving  the 
error  probability  density  per  count  after  prefiltering  are  shown  in  Figure  III-2 

for  unsealed  data,  data  scaled  by  16,  and  data  processed  with  floating-point 
arithmetic. 

In  the  third  stage  of  processing,  the  prefiltered  vertical-component 
data  traces  for  all  sites  at  time  t  are  summed  and  divided  by  the  number  of 
channels  to  create  the  beamsteer  output.  When  division  is  performed,  the  re¬ 
sult  is  rounded  to  the  nearest  count.  Multiplication  by  the  reciprocal  of  the 


number  of  channels  has  the  effect  of  dividing  the  argument  of  the  Fourier  trans 
forms  in  the  previous  stage  by  the  number  of  channels.  On  the  other  hand,  the 
assumption  of  mutually  independent  error  between  the  individual  prefiltered 
vertical  components  at  each  site  means  that  the  Fourier  transforms  corre¬ 
sponding  to  the  sum  of  the  traces  (scaled  by  1/M)  must  be  raised  to  a  power 
equal  to  the  number  of  channels.  Prior  to  rounding  of_f  to  the  nearest  integer, 

therefore,  the  Fourier  transforms  of  the  beamsteer  output  error  probability 
density  functions  are 


III-  1  4 


and 


for  unsealed  data,  data  scaled  by  16,  and  floating-point  data,  respectively, 
when  six  channels  are  used  in  the  adaptive  processor.  The  beamsteer  error 
probability  density  functions  prior  to  roundoff  are  shown  in  Figure  III- 3  for 
the  three  cases  considered.  Before  rounding  to  the  nearest  integer,  the  max¬ 
imum  possible  errors  are  exactly  the  same  as  in  the  previous  stage  —  2  1/6 
counts,  17  1/6  counts,  and  1  count,  respectively. 

In  those  instances  where  integer  arithmetic  is  employed,  roundoff 
error  may  occur  at  the  seven  discrete  points  ranging  from  minus  one-half  to 
plus  one-half  count  at  increments  of  1/6  count.  The  two  endpoints  are 
assumed  to  be  half  as  likely  as  the  interior  points,  so  that  the  Fourier  trans¬ 
form  for  the  roundoff  error  is  ^  (s)  both  for  unsealed  data  and  data  scaled 
by  a  factor  of  16.  The  beamsteer  roundoff  error  probability  density  for  both 
cases  is  pictured  in  Figure  III-4.  According  to  the  assumptions  stated  earlier, 
floating-point  arithmetic  produces  negligible  roundoff  error  in  the  beamsteer 
output. 

To  determine  the  beamsteer  error  probability  density  functions  after 
roundoff  for  unsealed  data  and  data  scaled  by  16,  the  Fourier  transforms  prior 
to  roundoff  are  multiplied  by  3K  ^  (s)  ,  and  inverse  Fourier  transforms  are 
taken.  The  probability  density  function  for  the  floating-point  beamsteer  error 
is  the  same  as  before.  Figure  III- 5  gives  the  three  specified  probability  den¬ 
sity  functions.  After  roundoff,  the  maximum  possible  errors  are  2  2/3  counts, 
17  2/3  counts,  and  1  count  ,  respectively.  One -half  count  is  added  to  the 
maximum  beamsteer  output  error  when  the  output  is  rounded  to  the  nearest  in¬ 
teger.  No  roundoff  procedure  is  performed  when  floating-point  arithmetic  is 
used. 


Ill- 16 


In  the  fourth  stage  of  processing,  each  individual  component  of  the 
vector  (X  -  X)  is  formed  by  subtracting  the  prefiltered  vertical  component 
for  the  t-th  site  from  the  beamsteer  output  at  the  same  time.  No  further 
roundoff  error  is  produced:  either  integers  are  subtracted  from  integers  or 

floating-point  arithmetic  generates  no  significant  error.  The  error  in  an 
individual  component  of  (X  -  X)  is 


IVi 

£ 


+  yt-j>  +  e  (t-j)  -  X.(t-j)  +  t.(t-j, 


IVI 

£ 


-  X.(t-j) 


fb(t-j>  + 


IVI 

£ 

k=l 

M 

£ 


V-j) 


(M-l) 

M 


Mt-j)  . 


where  «b<t-j)  is  the  beamsteer  output  roundoff  error  at  time  t-jAt  (due 
solely  to  the  roundoff  operation  carried  out  a.  the  tail  end  of  the  beamsteer 
output  computation),  where  is  the  error  in  the  prefiltered  vertical 

ponent  fot^the  t-th  site  at  time  t-jAt  (the  same  component  for  which 
the  error  in  X  -  X  is  being  evaluated),  where  t  (t-j)  is  the  error  in  the 

prefiltered  vertical  component  of  the  k-th  site  at  time  t -j At,  and  where  M 
is  the  number  of  channels.  If  the  term  e b<t-j )  is  ignored  and  P(s )  is  the 

Four.er  transform  of  the  probability  density  function  for  the  error  in  any  pre- 
filtered  vertical  component,  then 


['G)J 


III-  L  0 


is  the  Fourier  transform  of  the  probability  density  function  for  the  error  in 

each  component  of  the  vector  (X  -  X).  (As  in  the  case  where  the  beamsteer 

output  error  was  discussed,  the  random  variables  €  (t-j)  are  considered 

K 

to  be  mutually  independent.  )  Figure  III- 6  shows  the  corresponding  probability 
density  functions  for  unsealed  data,  data  scaled  by  16,  and  floating-point  data, 
respectively.  In  each  case,  the  maximum  possible  error  (ignoring  the  beam- 
steer  output  roundoff  error)  is  5/3  as  large  as  the  maximum  possible  pre¬ 
filtered  vertical-component  error.  As  a  result,  the  corresponding  maximum 
errors  are  3  11/18,  28  11/18,  and  1  2/3  counts.  When  the  term  e  (t-j)  is 

not  ignored,  the  Fourier  transform  of  the  probability  density  function  for  the 
error  in  each  component  of  the  vector  (X  -  X)  is 


for  the  two  cases  where  integer  arithmetic  is  employed.  The  error  for  floating¬ 
point  data  remains  the  same  as  before.  The  probability  densities  for  unsealed 
data,  data  scaled  by  16,  and  floating-point  data  are  graphed  in  Figure  III-7. 
Maximum  possible  errors  are  increased  by  one-half  count  during  integer 
roundoff,  and  the  three  maximum  errors  are  4  1/9,  29  1/9,  and  12/3 

counts. 

As  the  various  probability  densities  were  computed,  standard  devia¬ 
tions  for  the  error  were  calculated.  Table  III- 1  is  a  summary  of  the  standard 
deviation  and  maximum  possible  error  at  each  stage  for  the  three  modes  of 
processing  examined. 


Ill-  2  1 


-3.0-2.5-2.0-1. 5-1. 0-0.5  0.0  0.5  1.0  1.5  2.0  2.5  3.0 

(a)  Unsealed  Data 


20.0-10.0  0.0  10.0  20.0' 

(b)  Data  Scaled  by  16 


-1. 0-0.5  0.0  0.5  1.0 

(c)  Floating-Point  Data 


FIGURE  III -6 

ERROR  PROBABILITY  DENSITY  PER  COUNT  IN  THE  TERM 
x(t)  -  x.(t)  (BEAMSTEER  OUTPUT  MINUS  A  SINGLE 
CHANNEL)  BEFORE  BEAMSTEER  ROUNDOFF 


I 


-3*0-2. 5 -2. 0-1. 5-1.' 0 -0#' 5  0.0  0.5  mT"u5  2.0  2.5  3/ 

(a)  Unsealed  Data 


0.2 


PC  Y  3 


20.0-10,0  0.0  10.0  20.  o' 

(b)  Data  Scaled  by  16 


3.0 


2.5 


pen 


7" 

* - ^ \ 

-1. 0-0.5  0.0  0.5  1.0' 

(c)  Flcating-Point  Data 


FIGURE  III-7 


< 

H 


< 

Q 


< 

H 

< 

Q 


H 

2 


Q 
W 

< 

u  o 

W  CL 

2  . 

D  C 

O  H 

;  k  < 

;  g  3 

1  U  tn 


Q 

2 


>£> 


>< 

CQ 

Q 

W 

A 


I1T- 24 


^■f:,:,'-.,  - - -  ia.k^-.l^-  - - ...  .  ••---  .  . . .  -  .  ■ - 


C.  DIRECTIONAL  ERROR  OF  THE  VECTOR  (X-X) 

1.  Synopsis  of  this  Subsection 

In  the  next  part  of  this  subsection,  the  geometrical  relationships  be¬ 
tween  error  in  the  vector  (X-X)  and  the  maximum -likelihood  constraints  on 
the  falter  update  vector  (A  -A  1  )  are  explored.  By  separating  the  error 
vector  into  the  part  Ec  due  to  error  before  beamsteer  roundoff  and  the  part 
Eb  due  to  the  roundoff  process  occurring  at  the  very  end  of  the  beamsteer 

output  formation,  it  is  possible  to  show  that  the  vector  E  lies  within  the  con- 

c 

straint  space  and  the  vector  E ^  is  normal  to  the  constraint  space. 

In  part  three,  probability  distributions  for  the  squared  magnitudes 

I  |2  |2  I-  1 2 

Pc|  '  ^b)  ’  anC^  i  ”^l  are  sPecified. 

In  part  four,  the  probability  distribution  is  given  for  the  angle  rv  be¬ 
tween  the  vectors  E^  and  (X-X)  within  the  constraint  space. 

In  part  five,  the  probability  distribution  for  the  angle  6  between  the 
vector  (X-X)  +  Ec  and  (X-X)  is  derived  and  evaluated.  The  angle  6  is  the 
angle  between  the  vector  (X-X)  and  the  projection  of  the  vector  (X-X)  +  E  + 
onto  the  constraint  space. 

In  the  final  part  of  this  subsection,  a  mathematical  expression  for  the 

probability  of  the  angle  between  the  vector  (X-X)  +  E  +  E  and  the  vector 
—  c  b 

(X-.,.)  is  presented.  This  angle  is  the  angle  of  error  in  the  vector  (X-X). 

2.  Geometrical  Relationships  Between  the  Error  in  (X-X)  and 

the  Maximum- Likelihood  Constraint  Conditions 

The  maximum-likelihood  filter  set  is  designed  to  pass, with  unity  re¬ 
sponse  at  all  frequencies,  a  signal  from  the  desired  look  direction.  Provided 
that  the  data  channels  have  been  time-shifted  according  to  the  beamsteer  time 
delays,  this  requirement  results  in  the  2N+1  equations 


III- 2  5 


1V1 

Zai 


.(j)  =  8:. 


(j  =  -N,.  ..  ,-1,0,1. 


where  8  is  the  Kronecker  delta  operator.  If  the  filter  change  vector 
jo  ^ 

(AneW  -  A  )  satisfies  the  relationships 


M 

E  [o>  - 


and  the  old  filter  vector  satisfies  the  maximum-likelihood  constraints,  the  new 
filter  vector  also  satisfies  the  constraints* 


\  '  new  V™'  old  , 

2-r  ai  (J)  =  i  01  + 


M 

E  [a"ew^>-  *rd<»]  ■  v . 


Henceforth  the  constraint  space  will  refer  to  the  set  of  vectors  satisfying  the 


restrictions 


J-Vi. 

E 


new 

a.  (j)  -  a 


oid,Ml 

i  <)>_ 


imposed  on  the  filter  change  vector  for  all  lags  j  .  These  ZN+1  equations  each 
define  a  plane  of  dimension  M-l  in  the  M-dimensional  subspace  corresponding 
to  the  j-th  lag.  If  unit  vectors  U.  (each  within  the  subspace  associated  with 
the  j-th  lag)  are  defined  by  the  equation 


III- 2b 


In  the  previous  subsection,  it  was  demonstrated  that  the  error  in  a 
single  component  of  (X-X)  is 


+  M 


1VI 

E 


Vt_j)  -  . 


where  *b(t-j)  is  the  beamsteer  roundoff  error  at  time  t-jAt  and  where 
€k(t-j)  and  €  .(t-j)  are  the  errors  in  the  prefiltered  vertical  component  at 
time  t-jAt  for  the  k-th  and  i-th  channels,  respectively.  The  error  vector 
may  be  broken  down  into  the  portion  Eb  due  to  beamsteer  roundoff  error  and 
the  portion  Ec  due  to  error  in  the  prefiltered  vertical  component  outputs.  Since 
the  beamsteer  roundoff  error  is  identical  for  all  channels  at  time  t-jAt,  the 
vector  E^  is  N 

Eb  =  2  €b{t-j)V 

j=-N 

a  vector  normal  to  the  constraint  space.  The  space  of  ail  such  possible  vec¬ 
tors  Eb  is  spanned  by  the  2N+1  orthonormai  vectors  U.  .  It  is  therefore 
of  dimension  2N+1  .  J 


The  vector  E^  on  the  other  hand,  satisfies  the  constraint  equations: 


T 

U.  E 
J  c 


-L  V  -L  V 

vM  /  ^  M  /  v 


ek(t-j)  -  c .(t-j) 


k=l  i=l 


And  so  it  must  lie  entirely  within  the  constraint  space.  In  the  subspace  coin 
ciding  with  time  t-jAt  ,  the  vectors 


w, '  nw  «.■—»  mwwwaflfr 


uj =  /m 


w; ,  = 


1  j  \/  M(M-l) 


M-l 

-1 


-1 


W 


2J  \/(M-~l)(M-2) 


0 

M-2 

-1 


-1 


W  = 


1 


3jN/'(M-2)(M-  0 


0 

0 

M-3 

-1 


-1 


W 


1 


M-2,j  v/3.2 


0 

0 

• 

• 

0 

1 

• 

• 

• 

2 

,  and  W  .  , - 

M-l.J  \T  z-  1 

0 

-1 

1 

-1 

-1 

form  an  orthonormal  basis.  The  inner  product  between  Ec  and  W.  is 


E  W..= 
c  ij 


r  M 

E  vH) 

-  (M-i)  €.(t-j) 

_p=i+l 

\/(M+l  -  i)  (M-i) 


(M+l-i)  | 

M 

-  «,<«-!>  | 

i-1  . 

•El 

sl~ 

M2 

rr\ 

1 

_ ^ _ 1 

-  s'*-1') 

L  k=i  J 

_ P=1 

L  ,  ,k=] _ J 

v/(M+l-i)  (M-i) 


This  inner  product  is  not  identically  zero,  so  that  the  vectors  E^  fill  the  space 


III- 29 


spanned  by  the  orthonormal  vectors  W  (i  =  1,2,  ,  M-l ;  j  =  -N,  .  .  .  ,  - 1 , 0, 

1,  .  .  .  ,  N).  The  space  of  all  possible  vectors  E  ,  therefore,  is  of  dimension 

c 

(M-l)  (2N+1). 


3.  Probability  Distributions  for  the  Squared  Magnitudes 

I  \2  2  I  —  |2 

E  ,  E  ,  and  X-X 
I  c  I  b  I  | 

Eet  the  vinculum  operator  Q  for  any  function  Q  of  the  M(2N+1) 
variables  £^(t-j)  denote  the  mean  c'  the  function  Q  over  all  possible  values 
of  €.(t-j): 


M  N 

€.(t-j)=°o 

M 

N  r  -I 

ii  i: 

Q 

n 

n  P  €.(t-j)  d€  (t-j) 

i — 

»-»• 

n 

t _ _ 

ii 

i 

z 

i=l 

j  =  -N  J 

III- 3  0 


M 


M  L-J 

i=l  k=l 

i¥k 


=  (M-1)(2N+1)  £^(t-j) 


provided  that  the  zero-mean  random  variables  e  .(t-j, 
dent  and  identically  distributed.  1 


are  mutually  indepen- 


N  N  M  M  M 


/  M-l 

'1  M 


M 


M-  ] 
M 


€j(t"j)€k(t'j)€i(t'j)€k(t  j)  +  €i(t'j' *k(t'j)fk(t"j)€i(t" 


N  M  M 


M 


j  =  -N  i  =  l  k=l 

i^k 


(M-1,(2N+JJ  I  (M1) 


M 


+  2 


under  the  same  assumptions  as  before.  The  quantity  inside  the  braces  is  (M-l) 

2  o 

times  the  variance  of  €.  (t-j)  plus  twice  the  squared  mean  of  €  (t-j).  If  the 

unsquared  random  variable  £.{t-j)  were  normally  distributed,  the  following 

equations  would  be  valid: 


,  2 

2 

2 

€  j  (t-J )  -  €.(t-j) 

=  2 

111-  32 


<r  =  (M - 1 )  (2N+1  ) 


2,  ..  2 

Vt-j)  -  t.(t-j) 


<r  =  2(M -  1 )  (2 Nil  ) 


2 

F  (t-j) 


In  that  event,  these  equations  could  haw  been  deduced  by  trivial  manipulations 
of  the  X  -distribution  for  (M-l  )  (2N+1 )  dimensions.  Unfortunately,  the  pre- 
filtered  vertical-component  output  error  t.(t-j)  is  not  normally  distributed, 
can  be  seen  from  Table  111-2,  which  lists  the  variance  of  e  Z(t -j )  together 
with  the  ratio  of  its  variance  to  its  squared  mean  for  unsealed  data,  data  scaled 
by  16,  and  floating-point  data.  The  ratio  in  the  righthand  column  is  sufficiently 

different  from  two  that  the  non-Caussian  character  of  ..(t-j)  must  be  taken 
into  account. 

Ihe  random  variable  |e  -Z  ,  on  the  other  hand,  is  the  sum  of  the 

I  ^ 

2N+1  (i.e.  ,  31)  independent,  identically-distributed  random  variables 


M 


£ 

i=  1  L 


M 


1  2 


ej(t-j)  - 


M 


k  =  1 


The  Central  Limit  Theorem  will  be  assumed  valid,  so  that  the  probability  dis¬ 
cern  be  approximated  by  a  normal  distribution  with  mean 


tr  ibution  of  ,  E 


llc  and  variance  rrc  .  Table  111-3  gives  the  mean,  standard  deviation,  and 

variant  of  *E^  for  the  three  cases  being  considered.  Figure  111-8  is  a 
plot  of  the  corresponding  probability  density  func  tions  for  all  three  cases. 

Th.  random  variable  I  Eb  Z  ,  similarly,  is  the  sum  of  the  2N+1  in¬ 
dependent,  identically-distributed  random  variables  MtZ(t-j).  In  the  two 

has  the  same  probability  distribu- 


cases  involving  integer  arithmetic,  E 

tion.  With  six  channi  is,  the  probability  mass  function  for  the  unsquared  ran- 
dom  variable  e  ^  ( t  -  j )  is 


111-33 


TABLE  III- 


In  w 

o  o 

r.t  H 


<  Q 
r!  w 
S  H 

>  u 

Q  £ 

y.  O 

<  a; 

a 

y  x 
o  c 

p  ^ 

<  u 

►x  u 

>  > 
U  tt! 

n  K 
M  O 
Q  ex 
(X  (X 

<  w 


H  W 
cn  Q 
-  L 
^  H 

<  X 

H  o 

A  < 


p(y)  =  1/12 


(y  =  -1/2,  1/2) 


p(y)  =  1/6 


(y  =  -1/3,  -1/6,  0,  1/6,  1/3)  , 


2 

so  that  the  probability  mass  function  for  Me^(t-j)  is 


p(y)  =  i/6 
p(y)  =  1/3 


(y  =  0,  i  1/2) 

(y  =  1/6,  2/3)  . 


The  probability  mass  function  for  |Eb|  Is  obtained  by  convolving  this  func¬ 
tion  with  itself  2N+1  times.  Figure  III- 9  pictures  the  resulting  probability 
mass  function.  Probability  is  concentrated  at  discrete  points  between  0  and 
46  1/2  at  intervals  of  1/6.  The  vertical  lines  give  the  probability  for  each 

discrete  point.  The  mean,  standard  deviation,  and  variance  are  16.361  counts2 
Z  4  9 

2.805  counts  ,  and  7.87  0  counts  ,  respectively.  Note  that  the  plot  is  slightly 

skewed:  the  mode  point  (16.  167  counts2)  is  slightly  below  the  mean,  and  the 
probability  envelope  falls  off  somewhat  less  rapidly  on  the  right.  When  float¬ 
ing  point  arithmetic  is  used,  the  beamsteer  roundoff  error  is  assumed  to  be 
_  ,  I _  l2 


negligible,  so  that  E 

b 


=  0  . 


In  the  case  of  the  random  variable  tx-x|  ,  the  probability  distribution 
may  be  measured.  A  four-hour  noise  sample  from  day  238  of  1970  was  used 
for  this  purpose.  The  quantity  |x-x|  was  computed  every  31  points  using 
integer  data  scaled  by  a  factor  of  16.  The  resulting  values  were  divided  by 
256  and  sorted  by  magnitude  to  produce  the  cumulative  distribution  function 
and  histogram  of  Figure  111-10.  In  the  histogram,  a  bin  width  of  1000  counts2 
was  utilized.  The  vertical  axis  indicates  how  many  times  values  of  lx-x|2 
occurred  within  the  limits  of  a  particular  bin.  The  probability  distribution  of 


Figure  III- 10  applies  both  to  the  case  of  unsealed  data  and  floating-point  data. 
To  obtain  the  equivalent  distribution  for  data  scaled  by  16,  the  horizontal  axis 
needs  to  be  multiplied  by  256. 


Ill-  37 


4. 


Probability  Distribution  For  The  Angie  a  Between  The  Vectors 

E  and  (X-X) 
c 

Within  the  constraint  space,  the  direction  of  (X-X)  is  presumed  to  be 
independent  of  the  direction  of  E^.  This  assumption  may  be  satisfied  in  a 
number  of  ways.  For  example,  if  the  multidimensional  probability  density 
function  for  either  E^  or  (X-X)  were  dependent  solely  on  the  magnitude  of 
the  vector,  this  assumption  would  be  satisfied.  Such  would  be  the  case  if  the 
omponents  of  either  vector  were  mutually  independent  and  normally  distri¬ 
buted.  More  realistically,  if  the  preferred  orientations  of  one  vector  were 
weak  and  not  too  strongly  related  to  the  preferred  orientations  of  the  other 
vector,  the  assumption  would  be  a  good  approximation. 

Under  the  assumption  just  described,  the  probability  that  the  angle 
between  E^  and  (X-X)  is  less  than  the  angle  a  is  equal  to  proportion  of  a 
sphere  subtended  by  a  cone  of  angle  a  in  a  space  of  n  =  (M-1)(2N+1)  dimen¬ 
sions,  where  n  is  the  dimension  of  the  constraint  space  (n  =  155  for  6  channels 
and  31  filter  points  per  channel).  The  axis  of  the  cone  may  lie  along  any  di¬ 
rection  in  the  constraint  space. 

To  find  the  proportion  of  the  sphere  subtended  by  a  cone  of  angle  n 
the  polar  coordinate  transformation 


n 

7.  .  =  r  cos  () 

1  n- 1 

n 

7-.  -  r  sin  0  cos  0 

4  n-1  n-2 


n 

*n-l 

p 

sin 

0 

n-  1 

sin 

^  -  ’  sin 

n  -Z 

0  cos 

n 

c  c 

f> 

sin 

0  1 
n-1 

sin 

0  '  sin 

n  -Z 

°z  sin 

is  made.  The  superscript  n  denotes  the  dimension  of  the  space  in  which  the 


III-  40 


- - <^*^<*Mi*~**a»~**im*>Hm*m*M*H*.*n>Ji 


-■A. 


transformation  is  made.  The  Jacobian 
„  /  n  n  n  \ 

4i  •  . M 

S  . "l) 

expanded  in  terms  of  minors  relative  to  the  leftmost  column  of  the  determin¬ 
ant,  is 

»/  n  n  n\ 

^ .  ?n) 


S  . «l) 


n  n 


6  '  l  al7Z  ’  *3  ’ 


n 


df>  . ) 


„n/n  n  n\ 

Tz'  *3 . »n) 

s«„-i  . "- ) " 


since  dz  *jdO.  =  0  fo  r  all  value  s  of  i  from  1  to  n-2.  The  following  part¬ 

ial  derivatives  are  easily  evaluated: 


d  z 


df> 


cos  0 


-1 


n 


a  * 

i 

aTT 

n  • 


-  f>  sin  0 


n-1 


The  corresponding  polar  coordinate  transformation  in  the  next  lower  dimen¬ 


sion  is 


n-1 


=  p cos  0 


n 


-1 


•'2 


=  f)  sin  0  cos  0 

n-2  n-3 


n-1 

n-2 

i-l 


=  P 


sin  0  _  sin  0 


sin  0  cos  ^ 


n-2  — ■  n-3 

z  n  !  =  (>  sin  0  sin  0  ....  sin  0  sin  0 

n-1  n-2  n-3  2  1 


III- 41 


Note  that,  for  ail  values  of  i  from  1  to  n-1, 


n 


i+1 


•  ,  n-1 

s  in  ()  ,  z. 

n-1  l 


so  that 


d*. 


i  +  l 


dy  ■ 


n-1 


dP 


sin 


n-1  dP 


d7‘- 


i  +  l 


d7. 


n-1 


sin  0 


dP 


n-1 


dO 


for  ail  values  of  j  from  1  to  n-2,  and 


d-i. 


n 


i+l 


d() 


cos  0 


n-1 


n- 1  i 


n-1 

Z.  =  p  cos  0 


d  z . 


n-1 


n-l  dp 


d  z. 


n-1 


since 


n- 1 


dp 


z.  .  Therefore, 


n  n 
5(^i  .  r  .  ■ 


n 


n 


.  o. 


.  n-1  n-1 

„  2  .  n-2  „  5^1  ’  zZ  *  ’  •  *  *  7 

-  P  cos  0  sin  0  ,  — V— - 

n-1  n-1  ‘ 


"n-2 . °l 


n 


+  P  sin  0  sin"  ~  'J 

n-1  n-1 


n-1  n-1 

n-2  „  alZl  *z2  '•••’Zn 


d[P,  ()n  0 

n-L  1 


=  l‘  sin"-2  II 


n-1  n-1  n-1 

d  ( Z.  »  Z  •  •  *  •  »  Z 

1  2  n-1 


n-1 


d  P,  0  . 0. 

n-2  1 


I  he  Jai_  jbian  in  n  dimensions  can  be  expressed  as  the  product  of  P  sin*1  ^  0 
and  the  corresponding  Jacobian  in  the  next  lower  dimension.  For  the  case  of 
two  dimensions , 

2 

z  j  "  P  cos 

-  p  sin  0 


2 

2 


111-42 


and  the  Jacobian  for  two  dimensions  is 


cos  ft 


1 


SI 


n  6 . 


P  sin  0 ^  P  cos  0^ 


2  ,  2  , 

=  P  (cos  0^  +  sin  ^  )  =  p 


By  the  appropriate  inductive  reasoning, 

a(z" zn) 

\  f _ £ _ ny  _  p  n- 1  .  n-^ 


d\,0n-l*  '  *  *  ’  °lj 


sin 


f)  ■  n-3  /j  .  1  , 

V  sin  f)  .  .  .  sin  . 

n- 1  n-Z  < 


The  volume  V((v»n)  inside  a  sphere  of  radius  r  and  inside  a  cone  of  angle 
in  an  n-dimensional  space  is 


V(«,n) 


a 


The  surface  area  S(a,n)  on  the  surface  of  a  sphere  of  radius  r  and  lying 
inside  a  cone  of  angle  a  is  0V(n,n)/5r  ,  so  that  the  proportion  of  the  spher¬ 
ical  surface  subtended  by  a  cone  of  angle  «  is 


III- 4  3 


S(o? ,  n) 
S(tt,  n) 


LX 

/n-2 

sin  O  dd. 

n-1  n-1 

_0 _ 

I  s  inA  ^  0  d  0 
r,  n- 1  n- 1 


This  expression  is  the  probability  P(n,n)  that  the  angle  between  two  vectors 
from  the  center  of  the  sphere  to  arbitrary  points  on  the  surface  of  the  sphere 

is  less  than  or  equal  to  the  angle  a  (when  the  probability  density  at  aJl  points 
of  the  surface  is  uniform). 

The  integrals  may  be  evaluated  in  closed  form  using  the  two  formulas 


sin  xdx  =  -  cos 


k  (  n  [l  +  2(k-j) 

Efrr - - 

i  =  l(n  2(k+l-j) 
vj=l 


l+2(k-i) 

sin  'x 


T  1 

n  1  +  2(k-j) 

j=l _ _ 

k  1 

IT  2(k+l  -j) 

j=l  J 


•  ^k+1  =  -  cos  x 

sin  xdx 


/  i-i  r 

k+l  I  n  2(k-M  -  j) 

E«7 - 

i=l  /  II  1  +  2(k+l  -j) 

Vj  =  lL 


.  2(k+l  -i) 
sin  'x 


which  are  valid  for  all  non-negative  integers  k  .  When  the  upper  index  in  a 
summation  sign  or  product  sign  is  smaller  than  the  corresponding  lower  index, 
the  results  are  0  or  1,  respectively.  These  formulas  were  obtained  by  math¬ 
ematical  induction  using  the  formula 


sinkxdx  =  -  -COS  X  sin  ~  x  Shi 

k  k 


•  k"2  ^ 
sin  xdx, 


III- 4  4 


which  can  be  derived 
tribution  function  for 


using  integration  by  parts.  P(tt,„)  is  a  cumulative  dis- 
“  ’  30  that  the  Probability  density  function  is 


aP(q.n) 

da 


sin  a 


l 


,77 

■  n“2  a 

sin  0  d  6 

n-1  n-1 


Figure  III- 11  diagrams  the  cumulative  distribution  function  and  probability 
density  per  degree  for  the  angle  a  between  Ec  and  (X-X)  in  a  155 -dimen¬ 
sional  constraint  space.  Note  that  the  probability  density  function  in  strongly 
peaked  near  „  .  90°.  As  the  dimension  of  the  constrain,  space  increases, 
the  cumulative  distribution  function  approaches  a  step  function  jumping  from 
0  to  1  a.  90  .  Under  the  same  circumstances,  consequently,  the  probability 
density  function  approaches  a  delta  function  centered  about  90°. 

5'  "  F°r  the  *"**•  »  'he  Vectors 

It  is  possible  to  construct  a  two-dimensional  plane  passing  through  the 

origin,  any  specific  vector  (X-X),  and  any  specific  vector  E  .  The  resulting 

Plane  must  lie  within  the  constraint  space  since  the  origin  and  the  vectors 

(X-X)  and  Ec  lie  within  Ule  constraint  space.  Figure  111-12  illustrates  such  a 

P  e.  In  the  figure,  (X-X)  and  E^  are  as  defined  previously  in  this  section: 

“  ^  the  “8le  b<St"een  *he  ve‘‘°rs  Ec  *"d  (X-X)  .  and  6  is  the  angle  be- 

tween  the  vectors  (X-X)  and  (X-X)  +  E 

c‘ 

If  Ec  is  not  a  scalar  multiple  of  (X-X).  the  angles  a  and  0  are  between 
0  and  77  and  the  sine  law  holds: 


sin(«-  9)  |x-x| 

Sin0  "  |EJ 


III-45 


60  g 

FIGURE  TII-11 


C™?degre™^TNoleCT,0r^^D  pr“ab^y 

155 -DIMENSIONAL  CONSTRAInTspaCE ^  ^  ™ 


Division  by  sin  a  yields  the  modified  equation 


cot  0 


cot  a 


1 

X-X 

L 

Fc 

|  sin 

a 

Since  the  cotangent  is  a  monotonic  ally  decreasing  function  over  the  range  0  to 
tt,  6  <  9q  if  and  only  if  cot  0o<cot0.  Thus  9  <  9q  if  and  only  if 


< 

X-X 

E 

c 

|  sin 

a 

This  condition  is  automatically  satisfied  if  a  <9  ,  since  cot  9  -  cot  a  <0 

o  o 


when  a  <  Q  .  Hence  9  <  9 


a  <  e 

o 

or  if 

|E  I  <  lx~xl  sin  flp 
I  c|  ~  sin  (a-  Q  ) 
o 

when  a  >  0  .  It  is  not  necessary  to  consider  the  case  where  E  is  a  scalar 
°  _  c 

multiple  of  (X-X),  for  a  must  then  be  either  0  or  tt:  the  probability  that 
a  ^  0  or  a  >  tt  is  zero. 


Since  all  quantities  in  the  inequality  for  oc  >  6  are  non-negative,  both 
sides  of  the  inequality  may  be  squared: 


<  lx-x|2  sin2  g„ 
2 

sin  (a-  6  ) 


(a  >6  )• 

o 


With  a  few  more  minor  manipulations,  the  probability  that  6  <  9  when 

o 

a  >  9  becomes  the  probability  that 


|x-x|2  sin2  0O 
2 

sin  (a-  9  ) 
c  o 


HI-48 


where  and  cr^  are  the  mean  and  standard  deviation  of  the  ra'dom  vari¬ 

able  |eJ^.  Since  |e  |2  is  normally  distributed,  this  probability  is 


|  iiq  lx-xr  sin2  9q 

\~a~  “ r 

’  c  /.<  si n  (a-  0 


where  F  is  the  cumulative  normal  distribution  function 


F(z)  = 


z 

=  / 


\J  2  7 T 


-  yZ/2 
e  dy  . 


With  the  previous  assumption  that  the  angle  a  does  not  depend  on  the  magnitude 


of  E^  ,  the  probability  that  0  <  6  is 


P(  d  <  6  )  =  P  (a 

o 


a  -7T  . 

<ej>j  r 


u  I  X-xl  sin2  0 
_ _ o 

<rc  z 

u  sin  (a-  0  ) 
C  o 


|  p(a 


)d  a, 


where  p  (a)  is  the  probability  density  function  for  the  angle  a.  For  values 


of  0 ^  such  that  P(  a  <  0 •  )  is  negligible, 


P (9  <  0  ) 

o 


|  |x-x| 

\  [  f1  c 


2 

tan  0 


is  a  good  approximation  to  the  desired  probability  since  p(a)  is  strongly  peak¬ 
ed  at  a  =  tt/I  when  the  dimension  of  the  constraint  space  is  high. 

II 2 

E  |  are  fixed  for  each 
c  I 

of  the  three  modes  of  processing  under  examination.  Given  any  squared  mag¬ 
nitude  | X -X |2  of  the  vector  (X-X),  a  cumulative  distribution  function  P(  0  <  Q  ) 

o 

for  the  angle  9  between  the  vectors  (X-X)  and  (X-X)  +  E  can  be  computed. 
When  such  distribution  functions  are  evaluated  over  a  range  of  values  of  lx-x|2  , 


III-49 


the  result  is  a  probability  which  is  a  function  of  the  variables  8 and  |x-x|2. 

Figure  III- 1 3  is  a  perspective  drawing  of  this  probability  over  a  rectangular 

I  .  1 2 


grid  of  the  values  8 and  IX -X|  .  The  probabilities  are  plotted  for  the  case 
of  unsealed  data,  data  scaled  by  16,  and  floating-point  data.  In  each  of  the 
three  cases,  two  drawings  are  plotted  from  slightly  different  viewpoints.  The 
lower  plateau  in  each  drawing  corresponds  to  a  probability  of  zero,  the  upper 
plateau  to  a  probability  of  one. 


tx-xl" 


By  specifying  a  probability  distribution  for  the  squared  magnitudes 


of  the  vector  (X-X),  the  probability  distribution  for  the  angle  8  can 


be  found: 


P(  8  <  8  ) 

o 


P(  8  <  8  lx-x|2)  iP  |x-x|2 


The  values  lx-x|2  computed  during  a  four-hour  noise  sample  from  day  238 
of  1970  (see  Figure  III- 10)  have  been  assumed  to  be  equally  likely.  Figure 
III- 14  shows  the  resulting  probability  density  per  degree  for  the  angle  8  be¬ 
tween  the  vector  \X-X)  and  the  projection  of  the  vector  (X-X)  +  E  +  on¬ 
to  the  constraint  space.  In  the  graph  corresponding  to  each  of  the  three  modes 
of  processing,  there  are  minor  bumps  due  to  local  concentrations  or  local 
gaps  in  the  values  I  X~x|2  . 

If  the  squared  magnitudes  Ix-xl  and  |e  |2  are  replaced  by  their 
mean  values  and  the  angle  a  is  assumed  to  be  90°,  a  useful  estimate  of  the 
mean  8  of  the  angle  8  is  obtained: 


8  X  tan' 


w  LUll  _  I  _  a 

V  e(!x-xI2) 

where  E(lx-x|2)  is  the  mean  of  the  squared  magnitude  |x-x|2.  For  the 
noise  sample  from  day  238,  the  mean  of  lx~x|2  was  9693  counts2 


III-  50 


(a)  Unsealed  Data 


(c)  Floating-Point  Data 
FIGURE  III- 13 

CUMULATIVE  DISTRIBUTION  FUNCTION  P(0  <  OVER  A  RANGE 
OF  THE  SQUARED  MAGNITUDE  |x-x|2 


for  unsealed  data,  2,481,422  counts  for  data  scaled  by  a  factor  of  16.  Thus 
the  estimated  mean  of  6  is 


tan 


tan 


-1 


and 


tan 


-1 


3°5 ' 


1  °  1 2  * 


=  1  11 


respectively,  for  unsealed  data,  data  scaled  by  16,  and  floating-point  data. 
These  estimates  are  close  to  the  probability  density  peaks  of  Figure  III- 14. 


6. 


Probability  Distribution  for  the  Angle  0  Between  the  Vectors 


(X-X)  +  E  +  El  and  (X-X) 
c  b 


As  before,  it  is  possible  to  construct  a  two-dimensional  plane  that 

passes  through  the  origin,  any  one  of  the  vectors  (X-X),  and  any  one  of  the 

vectors  (X-X)  +  E  +  E  .  Such  a  plane  is  shown  in  Figure  III-]  5.  In  the 

c  b 

figure,  c/>  is  the  angle  between  the  vectors  (X-X)  +  E^  +  E^  and  (X-X)  and 

E,  is  the  beamsteer  roundoff  error  vector.  (X-X),  E  ,  and  cv  are  as  defined 
b  c 

previously.  The  line  running  from  the  origin  to  (X-X)  has  been  extended  to 
the  point  where  (X-X)  +  E  +  E^  projects  onto  it  in  order  to  form  a  right 
triangle.  Since  E^  is  perpendicular  to  any  vector  within  the  constraint 
space,  the  leg  of  the  right  triangle  adjacent  to  the  angle  0  is  of  length  Ix-xl 


+  E 


cos  cv  .  The  side  opposite  the  angle  0  consists  of  the  sum  of  the 


vector  E  and  a  vector  of  length  E  sin  cv  (within  the  constraint  space 
b  c 

and  perpendicular  to  X-X).  Since  Eu  is  perpendicular  to  the  constraint 

.  b  u  fl  2  i  |2~ 

space,  the  length  of  the  side  opposite  0  is  -w  E  |  sin  a  +  jE^  •  Thus  the 
cotangent  of  0  is 


III- 5  3 


(X-X)  +  E  +  E 
c  b 


FIGURE  III-  1  5 

TWO-DIMENSIONAL  PLANE  PASSING  THROUGH 
THE  ORIGIN,  (X-X)  +  E  +  E  AND  (X-X) 


X 


cot  0 


-xl 


COS  (V 


._  ,2  2  ,  ,2 
E  sin  «  +  E 
c  I  b 


Since  the  cotangent  decreases  monotonically  in  (0,  n),  the  angle  0  is  less 

than  or  equal  to  any  specific  angle  0q  within  the  interval  (0,  tt  )  if  and  only 

!  r  J 


if  cot  0  <  cot  0.  Hence  9  <  0  if  and  only  if 

O  o  J 


IX-X|+  E  cos  a 
cot  0  $  - - - - c-  .  . 


2  .  2 
sin  a  + 


If  0  0 _  ,  cot  0  -  cot  0  implies  that 


2  2 

sin  a  + 


2  _ 

=  (  |  X ~x|  +  |e  j  cos  tan2  0 

I  c  I  o 


This  equation  is  consistent  with  the  conic  section  corresponding  to  the  inter¬ 
section  of  a  cone  of  angle  0q  and  a  cutting  plane  perpendicular  to  the  con¬ 
straint  space.  The  cone  of  angle  0Q  has  its  vertex  at  the  origin  and  is  cen¬ 
tered  about  an  axis  through  the  origin  and  (X-X).  The  cutting  plane  is  deter¬ 


mined  by  the  three  points  (X-X),  (X-X)  +  and  (X-X)  +  E  +  E  For  a 
given  value  of  Ix-x!  and  a  given  angle  «  ,  the  region  where  0<0q  can  be 


expressed  as  an  inequality  involving  the  two  variables  |Ec|  and  |Eb|  .  In  fact, 
if  (X-X)  is  selected  as  the  origin  of  a  coordinate  systemwithin  the  cutting 
plane,  the  direction  of  the  vectors  E(;  and  Eb  can  be  used  to  specify  the 
directions  of  the  abscissa  and  ordinate,  respectively,  in  a  two-dimensional 

Cartesian  system  with  perpendicular  axes.  For  example,  if  0  <  tt/2  =  cv , 

the  region  where  0<  0q  can  be  expressed  in  terms  of  the  inequality 


E  2  <  |'X-x|2  tan2  0 


III- 55 


which  corresponds  to  the  interior  of  a  circle  of  radius  I  X-x|  tan  0  with  its 

—  o 

center  at  the  origin  (X-X)  of  the  cutting-plane  coordinate  system.  Since  the 
axis  directions  are  determined  by  and  Efa,  the  coordinates  are(|E  |,  |Eb|) 

for  any  vector  pair  (E^,E^).  Thus  all  points  lie  within  the  first  quadrant  of  the 
new  coordinate  system  for  the  cutting  plane. 

It  is  easier  to  define  the  region  where  0  <  0  with  inequalities  which 
express  |Eb|  in  terms  of  lE^  •  Since  the  results  obtained  in  this  way  will 
help  to  specify  |Ej  in  terms  of  | Efa |  ,  they  will  be  derived  first. 

If  c p  =  0 , 0  <  9  implies  that  0=0=0  and 


'|E  I2  sin2  a  +  E  2 

tan  0=0  =  VJ-l! _ b _ 

|x-x|  +  |Ec|  cos  a 

I  |2  2.2 

Thus  |E^|  and  E^  sin  ot  must  be  zero.  If  0<a<n,  only  the  single 
point  (  IE  I  ,  |E  I)  =  (0,  0)  satisfies  the  requirement  that  0  <  0  ,  If  a  =  0, 
|Ebl  must  be  zero  and  aU  values  of  |  E  J  must  lie  on  the  semi-infinite  line 
°<[Ecj<oo.  1^  a  =  tt  ,  |E^J  must  again  be  zero  and  all  values  of  |E  |  must 
lie  on  che  line  segment  0<  Ec<|x-x|  .  In  this  case,  the  angle  0  is  un¬ 
defined  when  |Ec  |  =  |  X-X|  . 

If  0  <  0  <  7r/  2  and  a  <  0  , 
o  ro 

tan  0^  [IX-X I  +  2  Ix-xl  |Ej  cos  a  +  |E  |  sin2  a  (cot2  a  -  cot2  0 


Thus  E 


=  (|x-x|  +  IE  I  cos  a)2  tan2  0 

I  c|  o 

The  required  inequality 


^2.2 
E  sm  cv  >  0  . 
c 


X-X 

+ 

E 

1  cos  a  1 

|x-x|  +  |e  I  cos  a 

r  n  f  rA  =  — 

c 

< 

11  1  c| 

0  <M 

+ 

E 

l  c 

cos  a)  tan  0  \J 

i  e.  .  2  ,  2 

E  sm  a  +  E, 

1  c  |  b 

is  satisfied  if  and  only  if 


111-56 


<|Ej  <(  |X-X|  +  |E  |  cos  cv  tan^  0  -  jE  | L  sin 


2  2 


2  .2 


a . 


If  0  <  0  <  77/2  and  o;  >  0  ,  0  is  always  greater  than  0  if  |E  I  > 

o  o  o  I  cl 

|x~x|  sin  0  /sin(a-  0  ).  The  following  inequalities  are  valid  if  sin  a  >  0  : 


E  sin  cv  > 
c 


lx-xl  sin  a  sin  ±o  _  I*-X| 


sin(  a  -  <t>  ) 
o 


cot  0  -  cot  oc 
o 


2  1 

2 

cot  0  > 

o  _ 

sin  a  +  E, 

1  b 

1  —  i 

|x-x| 

+ 

E  COS  <Y 

c  I 

I— 


.  ,2 

'  «  + 

E, 

1  bl 

|e  1 

V 

o 

i— i 

w" 

1  b  | 

1  b  | 

I  V-  I 

|x-x|  sin  0  /sin(a  -  0  )  , 

o  ro 

0  <  |Ec|  sin  a  <  (|X-X|  +  |Ec|  cos  a)  tan 


Now  if  |EJ  < 


and 


2  ^  2  2 

(Ix-x|  +  If  cos  a  )  tan  0  -  |e  I  sin  a  >  0 

c  o  I  rl 


:cordingly,  when  |E  |  <  |x~x|  sin  0  /  sin(a  -  0  )  , 
I  c  I  o  o 


cot  0  = 


| X - X |  +  |E  |  cos  a  |x-x|  +  |EJ  cos  a 


(Ix-X 


+  E  cos  a)tan  0 
c  o 


2  2  2 
E  sin  cv  +  E 
c  b 


if  and  only  if 


III- 57 


I 

a 


*■!<*  - - - - - 


^  ^(Ix-xl  +  |E  |  cos  a)  tan  0  -  | E  |  sin^  cv  . 


When  0  <  0  <  it/ 2  and  a  >0  ,  therefore,  0<0  if  and  only  if  both  of  the 
o  o  o 

conditions 


x-x| 


sin  0 


E  |  < 
c 


sin(cv  -  0  ) 
o 


0<  E, 


2  . 


<  (Ix-xl  +  |E^  |  cos  a)  tan  0q  -|eJ 


i  2  .  2 

sin  a: 


are  met. 


If  0  =  tt/2  , 
'  o 


x-xl 


0  =  cot  0  4 
o 


+  E  cos  a 
c 


_  2  .2 
E  sin  cv  + 
c 


if  and  only  if 


X-X  +  |E  I  cos  cv  >  0 


If  «  <  7r/2  ,  the  condition  is  always  satisfied.  If  a  >  7t/2 , 0  <  0  if  and 

7  ~  ro 

only  if 


Ix-xl  Ix-xl 


sin  if) 


Ecls 


-cos  CV 


sin(  cv  -  0  ) 
o 


If  it!  2  <  0  <  7r  and  cv  <  0  ,  0  is  never  greater  than  0  .  Either 

o  o  r  °  ^o 

a  <  n /  2  and  cot  cv  >  0  so  that 


cot  0  - 
o 


E  sin  tv 
c 


cot  o;  <  0  < 


X-X 


/~ 

2  . 

2 

,2/  / 

|2 

2  , 

/E‘ 

sin 

cv  + 

Eb 

I7  v 

Ecl 

sin  a  +  E^ 

III- 58 


3 


and 


Ix-x 


cot  0  < 

o 


+  |E  cos  a- 
c 


vhl 

or  o;  >  tt/2  and  cot  a  <0  so  that 


2.2  ,  |2 

sin  fv  +  |E 

b 


cot  0  + 

o 


N si 


sin  a 


^  si 


.  2  2 

sin  a  -f  E 

b 


(  -  cot  a) 


<  cot  0  -  cot  (V  4  0  4 
o 


X-X 


^/|eJ2  sin2n+  | E  |2 


and 


c  o  t  0  4 
o 


|x-x| 

+  E 

c 

COS  a 

s/kl2 

.  2 

s  in  nr  + 

IM2 

Finally,  if  ir/2  < »  4  a  and  „  >  0  ,  0  is  „ 


if  |  E  |  -  Ix-xl  sin  oS  / sin{« -  ^  ). 


never  greater  than  0 


E  sin  a  4 
c 


o'’  The  following  conditions  are  satisfied: 
lx-x|  sin  «  sij 


sin  0 
o 


x-xl 


sin{  a  -  0  ) 
o 


cot  0  -  cot  O' 

o 


E^  I2  sin2  a  +  Je.  “2 


D 


COt  0 

O 


<  |Ec|  sinacot  0q  <|X-Xl  +  |Ec|  coso: 


III  -  5  9 


.■  ....... ■  . . . .  -  *•»  ,  i.ii 


On  the  other  hand,  if  |E  |  >  |x-xl  sin  0  /  sin(a-  (p  ),  then 
(  I  X  -X  |  +  | E^|  cos  a)2  tan2  0^  -  | J 2  sin2«  >  0  and 


-  cct  0 
o 


“  (  I  X-xl  |E  |  cos  a  ) 

- - - -  > 

(|x-x|  +  JE  |  cos  ot)  tan  0 


if  and  only  if 


(lx-xl  +  |E  |  cos  a) 


2.2  .  .2 

sin  a  +  |E^| 


|Fb|  —  (I X ~x|  +  |E^|  cos  o')  tan  0 


|E  I  sin2  a  . 


c.  .2  ,  , 
bince  |Eb  nas  a  discrete  probability  distribution  and  E  is  nor¬ 
mally  distributed,  it  is  easier  to  compute  the  probability  that  0  <  0  when 

|Ec|  is  given  in  terms  of  |Eb|  •  For  each  value  of  |Eb|  ,  there  exists  a 
range  of  | E ^  |  values  such  that  0  <  0°  .  The  probability  of  this  range  of  jE 
values  is  the  difference  between  two  cumulative  normal  distribution  functions. 


tion 


To  specify  the  boundary  of  the  region  where  0  4-^  ,  the  quadratic  equa- 


<|EJ  S*n  a  +  |Eb|  ^  COt  -  (l^-X|  +  |E  j  cos  or)2 

is  used  to  solve  |  E^  |  in  terms  of  |EJ  .  Ordinarily  only  one  solution  for  |E  | 

is  the  correct  definition  of  the  boundary.  Since  the  conditions  for  0=  0  have 

negligible  probability  (JeJ  =  0  )  and  since  0  4  0  whenever  n / 2  <  0 

and  rv  <  ^  ,  only  the  cases  0  /L  0  tt/2  and  tt/2  <  0  /La  will  be  discussed. 

°  o 

When  0  C  ^  err/  2  and  a  4  the  inequality  |  E  ^  |  <  |X-x|  tan  0 
guarantees  that 

(Ix-Xl  +  |Ec|  cos  a)  tan  0  A  2  sin2  a  +  |eJ2 


III- 60 


and  hence  that  <p  <  0 


|_  |2  .2 

2  / 

2  2 

2 

E  sin  Of  -f 

1  c  1 

Eb  -  V^c 

cos 

o  tan  o 

lx-x|‘ 


tan  0 


<  tan  0^  Er  ‘  cos  or  +  lx-xl2  <  tan  ^  (|X-X|  +  |E  |  cos  a). 


If  |E  I  >  Ix-x!  tan  0  when  0  /_  0  2_  n/2  and  a  4  0  ,  <p  £.  0  if  and  only  if 
.  Dl  o  O  o  1  ro 

|E  |  satisfies  he  linear  inequality 


|E  I  tan  0  >  |E,  I  -  |X-X(  tan  t ; > 

I  c  I  o  ~  I  b  I  o 

when  or  =  0  ,  the  hyperbolic  inequality 


/  2  2  2  2 
cot  0  ylx-xl  +  IE  I  (cot  «  -  cot  0  )  -  IX-Xl  cot  « 

e  sin  «  a  - - Liii _ _ _ _ _ 


2  2 
cot  at  -  cot  0 


when  0  4  i  0  ,  or  the  parabolic  inequality 


cos  «  > 


|eJ  cot^  0  -  Ix-Xp 


Ix-x 


when  a  =  0 
o 


^  0  <  tt/2  and  or  >  ,  |e^  |  must  be  no  larger  than  |X-X|  sin  0  / 

sin(  or-  0  )  in  order  that  0  <  0  (as  was  shown  earlier).  The  most  difficult 
®  o  2  2  2? 

case  occurs  when  <  «  <  ^ 2-  Whenever  |E  I  >  |X-Xr/(cot  0  -coto), 
.  I  o 

d*  is  always  greater  than  0  : 


(|Ej2sin2a+  (Ejhcot2? 


III-  61 


sin  a  + 


.X-X, 

2  ,  2 
cot  0  -  cot 
o 


—  I  cot 

a  / 


Ec  cos  «  +  |eJ2  sin2 a  (cot^  -  cot2  a)  +  Jx~xl  cot2  a  +| 

2  .  2 
cot  0  -  cot  a 
o 


.  _  2  2 
>  cos  a  +  2 |E 


|EJcos  alx-xl  +  |V-X!2  =  (  |  X-Xl  +  |EJ  cos  n)‘ 


since 


2  ? 

0<(cot  Oq  -  cot  a)  |Ec|  sin  a- 


X -x| cot  a 


2  2 
cot  0  -cot  a 
o 


2  2 

=  |EC|  sin  «  (cot20o  -  cot2a)  -  2  |Ec|cosa|x-x|+  Jx-xl^  cot2g 


^  .,|2  .2 


cot  0  -  cot2a 
o 


Thus  \ /  E 


2  .  2 


/jy  ,?t\>  '*-x|  ♦  » «**>* .  FO, 


the  range  0  <  E  |2  <  |x-x|2  tan^0  , 


0  <  0Q  if  and  only  if 


)^|Ec|sin„  4  -jCOta+C°«?0\/iS-xl2  ■  |Eh[2(cot2^. 


2  , 
cot  a) 


2  ,  2 
cot  0  -  cot  a 
o 


when  0  <  «  <tt/2.  For  th 


both  of  the  inequalities 


e  range 


lx-x|2tan2  0  <  |e  I2  < 

o  I  b  I 


x -x| 2 


2  ,  2 
cot  0  -  cot  a 

o 


lx-xlcot«-  cot^Vlx-xh  -|eJ2  Icot2^,  -  cot2<v) 


$  |E  I  sin  a 


E  sin  a  < 
c 


X ~x| cot  a  +  cot 


%  Vix-xl2  -  K|Z  (cot2?  -  cot2„ 


2  2 
cot  < p  -  cot  a 


must  be  satisfied  if  <!>  <  0  .  The  reo  'nn  r 

~  o  reg  on  0  %  ^  forms  an  ellipse  in  the  cut¬ 

ting  plane  when  0  <  0  <  a  <  tt/ 2 

7o 

.  If  0<9o<7r/2  -  0  is  always  greater  than  0  whenever  |E  |> 

IX-xl  tan  rh  •  °  I  bK 


cot  9  > 

o 


Ix-xl 


|2  .  2 


^  ,,|2.  2 


l-^c  Sln  «+  I X -X  |  tan  0 


X-X 


Ec|2  si"2«  +  N2 


|  X-X|  +  |E^|  cos  cv 


ir  2  ■  2  .  .  2 

Ec|  Sln  |Eb| 


If|Eb|<  |x-x|tan9(j  when  0  <%<  tt/2  <  a,  p  ^  if  lnd  only  if 
satisfies  the  circular  inequality 


K|  -  l^-X|2tan20  _  |e  I2 

cl  o  I  b| 

when  a  =  tt/  2,  the  elliptic  inequality 

|Ec|sincv< _ °ty°  VjHE!  "  (COt  0o‘  COt  a)  +  Ix-xlcot  cv 


2  2 
cot  0  -  cot  O' 

o 


wren  ^  <a<7r-<;£  ,  the  parabolic  inequality 


III- 6  3 


|E  |  ( -cos  a)  < 


Ix-xl2  -  |Eb|2  cot2  <7>o 

2lx-x| 


when  a  =  ir  -  <f>  ,  the  hyper  bolic  inequality 


E  sin  «  < 
c  — 


n  «  1 

lx-xl  +  |Eb|  (cot  a  -  cot  0  ) 

2  2 
cot  a  -  cot  0 


when  7 r-  0  <  n  <tt>  or  the  linear  inequality 
o 

K|  tan0Q<|X-X|  tan  0^  -  |  Efa  | 

when  a  =  tt  . 

It  was  shown  earlier  that  0  <  0  if  and  only  if 

o 

|Ec  |  (-cos a)  <  lx-xl 

when  <r/2  =  0  <  a. 

o 

If  7r/2  <  0  <  a,  0  <  0  if  and  only  if  |  E  j  satisfies  the  hyperbolic 
inequality 


LIT* 

sin  a  <  - 

|X-X|  cot  a  +  cot  0  lx-xl  +  |Eb|  (cot2a-  cot  <p^) 

1  C 

2  2 

cot  a  -  cot  0 

—  “ 

when  «<7 r,  or  the  linear  inequality 

hi*1*-*1  -  lEbl cot  t 


when  «  =  7 t  .  These  last  two  inequalities  are  automatically  satisfied  when 

IE  I  <  |x~xl  sin  0  /  sin(  a  -  0  ). 

I  c  I  o  o 


III- 64 


Figure  III- 16  shows  the  region  in  the  cutting  plane  where  0  <  0  for 
a  variety  of  angles  a  and  0^  .  The  normalized  variables  C  =|E  j/  |X-xl 
and  B  =  |Eb|  /  lx-x!  replace  the  variables  | E  |  and  |Eb|  ,  respectively. 

The  most  easily  obtained  information  about  the  probability  distribution 
of  0  is  an  estimate  of  its  mean  0  .  If  the  squared  magnitudes  |X-x|2, 

lEcl  ’  and  |Ebl  are  rePlaced  bY  their  mean  values  and  the  angle  a  is  assumed 
to  be  90°, 


0  ^ 


tan 


^c  +  ^b 
E  (  |X-X|?  ) 


_  2 

where  E  ( |x -X I  )  is  the  mean  of  the  squared  magnitude  |X-X|2  ,  fi  is 
the  mean  of  (e^ |  ,  and  ^  is  the  mean  of  jEj2  .  For  the  noise  sample 
from  day  2  38  of  197  0  the  estimated  mean  of  0  is 


tan 


-1  ,  /  28.  071  +  16.  361 


9693 


3°  52'  , 


tan 


-1  /  1081. 952  +  16.  361 


2481422 


1°  12'  , 


tan 


-1  /  4.  133  +  0.  0 


9693 


1°  11'  , 


respectively,  for  unsealed  data,  data  scaled  by  16,  and  floating-point  data.  The 
estimate  of  the  mean  for  the  angle  of  error  in  (X-X)  is  47'  greater  for  unsealed 
data,  less  than  1'  greater  for  data  scaled  by  16,  and,  of  course,  exactly  the 

same  for  floating-point  data  when  the  effect  of  the  beamsteer  roundoff  error  is 
considered. 

The  associated  equation 

E  (IX-X|2)  ^  -^c.  + 

tan2  0" 


III- 6  5 


can  be  used  to  select  analog -to -digital  conversion  factors  and  computer  data 
scale  factors  which  achieve  a  desired  :>ngle  of  error  0  .  If  the  A/D  system 
is  fixed,  for  example,  the  minimum  achievable  angular  error  can  be  deter¬ 
mined  by  considering  only  the  digitization  errors.  This  minimum  achievable 
error  can  be  approached  arbitrarily  closely  through  the  choice  of  a  suitably 
high  data  scale  factor.  As  the  data  scale  factor  increases,  so  does  the  term 
E(| X -X I  }  when  expressed  in  squared  computer  counts.  In  turn,  the  computer- 
count  value  of  E(|X-X|)  is  almost  sufficient  to  express  the  computer -count  RMS 
level  for  the  data  points  x.(t-j)  and  hence  the  number  of  bits  required  to  express 
it.  The  dynamic  range  of  the  data  is  a  separate  problem,  however.  The  number 
of  guard  bits  needed  to  reduce  the  frequency  of  data  clipping  to  acceptable  levels 
is  a  function  of  the  fluctuations  above  the  data  RMS  level.  If  both  the  A/D  sys¬ 
tem  and  computer  data  scaling  are  subject  to  control,  a  wider  range  of  A/D 
conversion  factors  and  computational  scale  factors  is  possible.  For  a  given 
desired  angle  0  ,  in  this  event,  there  will  be  a  curve  relating  the  A/D  con¬ 
version  factor  to  the  computer  data  scale  factor. 

The  more  useful  approximation 


P(^<  <£)  »  P  (  Ec  ‘  +  |eJ2  <  |x-X|2  tan2  0  ) 


is  based  on  the  assumption  that  the  angle  cv  is  always  tt/2.  Since  the  pro¬ 
bability  density  for  oc  is  strongly  and  symmetrically  concentrated  about  a 

value  of  tt/2  and  since  the  probability  that  0  <  0  is  quasi-linear  in  the 

o 

neighborhood  of  «  =  n/Z,  the  approximation  is  remarkably  accurate.  Along 
the  curve  |X-X|  tan  0^  =  (where  the  estimate  is  i/2  when  Je^2  =  0), 

the  actual  values  ranged  from  0.  50  at  cv  =  n/Z  to  less  than  0.  55  at  the  high¬ 
est  |X-Xl  value.  In  most  cases  the  approximation  underestimates  P(  0  <  0  ) 
and  hence  tends  to  overestimate  the  angle  of  error.  By  specifying  a  probabil¬ 
ity  distribution  for  the  squared  magnitudes  |X-X|2  ,  the  probability  distribu¬ 
tion  for  the  angle  0  can  be  approximated: 


III  -  67 


l 

P(0  <  9o)  P(|Ec|2  +  |Eb|2  <  lx  -xl 2  tan2*)  dp|X-x|' 

r\ 


0 


Once  again,  as  in  the  casejjf  the  probability  density  functions  graphed  in 
Figure  111-14,  the  values  |x-x|2  from  a  four-hour  no.se  sample  from  day  238 
of  1970  have  been  assumed  equally  likely.  Figure  111-17  is  a  plot  of  the  corres¬ 
ponding  probability  density  per  degree  for  the  angle  of  error  0  .  Some  idea  of 
the  accuracy  of  the  approximation  can  be  gleaned  from  a  comparison  of  Figures 


1II-I4  (c)  and  111-17  (c).  In  the  case  of  floating-point  data,  |eJ2  =  0  and  the 


o  probability  distributions  are  identical:  the  approximation  has  shifted  the 
twin  peaks  a,  1°  7.5'  and  1°  17.25'  to  1°  7.8'  and  1°  17.  55'  (only  0.  3'  error). 

In  the  case  of  data  scaled  by  16,  the  twin  peaks  in  Figure  111-14  (b)  are  0.  9' 
higher  than  for  the  floating-point  data.  After  allowing  for  error  perpendicular 
to  the  constraint  space,  they  are  now  1.2'  to  1.5'  higher  in  Figure  111-17  (b) 
than  the  peaks  for  floating-point  data.  With  unsealed  data,  however,  noticeable 
increases  are  visible  in  Figure  111-17  (a)  as  compared  with  Figure  111-14  (a). 

The  first  peak  has  mo' ed  44.25'  from  2°  55.5'  to  3°  39.  75',  the  second 
53.25'  from  3°  21.  375'  to  4°  14.625'.  These  changes  (due  to  beamsteer  round¬ 
off  error)  are  in  rough  agreement  with  the  47'  increase  predicted  by  the  approx- 
imations  for  0  and  cj) 


If  the  complete  calculation  is  to  be  performed,  probabilities  must  be 
computed  over  a  range  of  IX-xl2  and  ^  values.  To  obtain  one  of  these  pro¬ 


babilities,  a  numerical  integration  over  the  interval  0  <  «  <  v  is  necessary. 
And,  finally,  for  any  particular  angle  a  ,  a  summation  of  cumulative  normal 
distribution  functions  (each  term  corresponding  to  one  discrete  value  of  |E  |2) 
must  be  carried  out.  The  necessary  mathematical  relationships  have  bee^de- 
tailed  in  this  part  of  subsection  C.  Although  it  is  possible  in  principle  to  ac¬ 
complish  the  required  computations,  they  have  not  been  done  because  of  the 

computer  time  involved.  It  is  necessary  to  be  content  with  the  results  dis- 
played  in  Figure  III-17. 


Ill  -  68 


-  -  ■■  —  . . . .  . .  -  -a-. . . . - - ' - ■" - 


PROBRBILITY  DENSITY  PER  DEGREE  PRDE 


(a)  Unsealed  Data 


PHI 

(b)  Data  Scaled  by  i6 


PHI 

(c)  Floating-Point  Data 


FIGURE  III- 17 

PROBABILITY  DFNSITY  PER  DEGREE  FOR  THE  ANGLE  BETWEEN 
THE  VECTOR  (X-X)  AND  THE  VECTOR  (X-X)  +  E  +  E 


At  this  point,  it  is  worth  mentioning  that  the  beamsteer  roundoff  erior 
can  be  eliminated  by  multiplying  each  component  x(t-j)  -  x.(t-j)  in  the  vector 
(X-X)  by  M  ,  the  number  of  channels: 

M  £ x(t-j)  -  x.(t-j)j 

-  Mx.(t-j)  . 

In  calculating  the  filter  change  vector  (A  -  A  ),  a  compensatory  scaling 
by  1/M  can  be  executed.  In  the  scaled  vector  M(X-X),  all  the  quantities  invol¬ 
ved  are  integers.  The  result  is  an  integer,  and  no  roundoff  error  is  produced 
in  the  scaled  beamsteer  output  M  5c(t-j)  .  If  no  overflow  occurs,  the  scaled 
vector  M(X-X)  satisfies  the  constraint  conditions  exactly.  The  eradication  of 
the  beamsteer  roundoff  entails  one  multiplication  for  each  point  x.(t-j)  used 
in  the  adaptive  filter  update.  If  the  values  M  x^(t-j)  are  stored  for  use  in  sub¬ 
sequent  updates,  only  M  multiplications  per  sample  period  are  required. 

Even  so,  however,  the  additional  computations  may  preclude  implementation 
of  this  procedure. 

The  discussion  near  the  beginning  of  subsection  B  mentioned  that  noise 
reduction  lor  one  data  sample  was  increased  from  2  dB  to  6  dB  (relative  to  the 
beamsteer  output)  simply  by  scaling  the  data  points  by  a  factor  of  16.  In  view 
of  the  relatively  small  angle  of  error  in  the  vector  X-X  (even  for  unsealed 
data),  the  principal  source  of  error  probably  occurs  elsewhere. 

D.  ERROR  IN  THE  FILTER  OUTPUT  XTA  =  y(t) 

In  this  subsection,  the  effects  on  the  adaptive  filter  output  y(t)  of 
error  in  the  data  vector  X  are  studied.  One  effect  is  that  the  sign  of  the 
adaptive  filter  output  may  change  so  that  the  filter  step 


HI-70 


WHMWVu 


I 


.new  .old 
A  -  A 


2K  y(t)  (X-X) 
s  _ 

(X-X)T(X-X) 


I 

i 


is  in  the  wrong  direction.  As  mentioned  in  subsection  A,  the  effect  of  re¬ 
versing  the  direction  of  the  filter  update  vector  is  the  precise  opposite  of 

what  adaptive  filtering  sets  out  to  accomplish:  instead  of  reducing  the  mean 

2 

square  filter  output  y  (t)  ,  the  sign  reversal  causes  the  filter  vector  to  move 

in  a  direction  which  tends  to  increase  the  average  squared  filter  output.  A 

second  effect  is  that  the  filter  vector  A  does  not  move  if  the  dot  product 

q 

(X  +  E  )  A  lies  between  -1/2  and  1/2  and  is  consequently  rounded  to  zero 
c 

when  integer  arithmetic  is  used.  (This  effect  does  not  take  place  when  floating¬ 
point  arithmetic  is  employed.  )  When  the  filter  update  vector  cannot  move,  the 
adaptive -filtering  process  is  delayed  until  the  next  non-zero  value  y(t)  of  the 
filter  output.  Over  a  large  number  of  iterations,  the  result  is  an  effective  re¬ 
duction  in  the  convergence  rate  through  neglect  of  those  times  t  where  the 
adaptive  filter  output  y(t)  is  zero.  These  first  two  phenomena  are  discussed 
in  part  1,  A  completely  different  effect  is  that  the  filter  output  power  is  alter¬ 
ed.  A  method  to  calculate  the  modified  filter  output  power  from  the  original 
crosscorrelation  matrix  {without  digitization  and  computational  error)  and  the 
probability  distributions  for  the  error  is  derived  in  part  2„ 

1.  Probability  That  y(t)  Changes  Sign  or  Rounds  To  Zero 

In  order  to  calculate  the  probability  that  the  filter  output  y(t)  changes 
sign  or  rounds  to  zero  after  adding  the  error  vector  E  to  the  data  vector  X, 
the  probability  distribution  for  the  error  in  y(t) 


(t)  = 


N  M 


2  aiU>  'i"'3’ 


i  =  -N  i=l 


before  roundoff  must  be  determined.  Since  the  probability  density  function 


III- 71 


for  the  roundoff  error  e.(t-j)  in  any  single  component  of  E  is  symmetric 
about  zero,  the  probability  distribution  for  the  error  e  (t)  in  y(t)  before 
roundoff  is  the  same  as  for  the  random  variable 
N  M 

X]  h(j)|  ei(t"j)  • 

j=-N  i=l 

The  probability  density  function  for  this  random  variable  is  the  convolution  of 
the  probability  density  functions  for  the  M(2N+1)  random  variables 

I  ai(j)K(t-j)-  The  variance  of  e y(t)  is  least  when  the  adaptive  filter  output 

is  the  beamsteer  output 

M 

J(,)  =  ~h  Y  xi(t) 

i=l 

and  greatest  when  each  filter  weight  a^j)  assumes  its  maximum  absolute 
value.  In  the  adaptive  filtering  program  used  for  this  report,  the  filter 
weights  clip  when  their  absolute  value  is  one  half,  so  that  the  highest  vari¬ 
ance  occurs  when 


N  M 

V11  =  t  E  E  • 

j=-N  i=l 

Such  a  situation  can  happen  when,  at  the  zero  lag  j=0,  four  weights  are  1/2 
two  -1/2,  and,  at  non-zero  lags  j  ±  0,  three  weights  are  1/2,  three  -1/2. 
The  probability  density  function  for  the  most  favorable  case,  i.  e.  ,  when  y(t)  = 
x(t),  was  plotted  in  Figure  III- 3.  When,  in  the  most  unfavorable  case  con¬ 
ceivable,  all  186  filter  weights  are  of  magnitude  one  half,  the  probability 
distribution  for  the  random  variable 

N  M 

I  E  E 

j  =  -N  i=l 


III-72 


is  accurately  approximated  by  a  normal  distribution,  so  that  the  probability 
density  function  for  the  error  in  y(t)  before  roundoff  is 


[e  (t)l=  - — 

L  y  J 


2  ,  2 
-(«  / 2o-,) 


where  cr f  is  the  sum  of  the  186  identical  variances  for  each  of  the  random 
variables  e.(t-j)/2.  That  is  to  say. 


<re  =  46.5  Var 


The  resulting  probability  density  functions  for  the  three  cases  are  graphed  in 
Figure  III- 1 8 . 

It  is  now  possible  to  calculate  the  conditional  probability  that  the  sign 
of  y(t)  is  inverted  given  the  absolute  value  |  y(t)  |  :  it  is 


at 

/ 


I  y(t)  |  +  1/2 


P  [e  (t)l  d  e  (t) 

L  y  J  y 


for  integer  arithmetic, 

00 

/  p  [£y(t)]  "V*1 

|y<t)l 

for  floating-point  arithmetic.  Similarly,  the  conditional  probability  that  y(t) 
rounds  to  zero  is 

y(t)  +  1/2 

/  P  [%<*>]  d  ev(t) 

"Jy(t)|  -  1/2 


III- 7  3 


(a)  Unsealed  Data 


(b)  Data  Scaled  by  16  (c)  Floating-Point  Data 

FIGURE  III- 18 


ADAPTIVE  FILTER  OUTPUT  ERROR  PROBABILITY  DENSITY 
PER  COUNT  BEFORE  ROUNDOFF  (MAXIMUM  POSSIBLE  VARIANCE) 


for  integer  arithmetic,  ?,eio  for  floating-point  arithmetic.  The  conditional 
probability  that  the  sign  of  y(t)  is  reversed  or  that  y(t)  rounds  to  zero  is 
displayed  in  Figure  III- 19  in  the  most  favorable  case  for  unsealed  data,  data 
scaled  by  16,  and  floating-point  data.  The  stippled  areas  designate  the  pro¬ 
bability  that  y(t)  rounds  to  zero,  the  striped  areas  the  probability  that  y(t) 
changes  sign.  The  corresponding  probabilities  in  the  most  unfavorable  case 
are  depicted  in  Figure  III-20. 

For  integer  arithmetic,  the  total  probability  that  the  sign  of  y(t) 
changes  is  the  integral 


/ 


00 


/  ' 

|  y(t)  |  +  1/2 


e  (t)l  d  e  (t) 

L  y  J  y 


over  the  interval  (0,  oo)  of  the  conditional  probability  of  sign  inversion  given 
I  y(t)|  multiplied  by  the  probability  density  for  the  absolute  value  |y(t)  |  .  The 
total  probability  that  y(t)  rounds  to  zero  is  obtained  by  replacing  the  inner 
integral  by  the  conditional  probability  that  y(t)  is  zero  after  roundoff:  the 
interval  of  integration  is  from  |y(t)|  -  1/2  to  |y(t)  |  +  1/2.  For  floating¬ 
point  arithmetic,  similarly,  the  limits  of  integration  change  to  |  y(t)  |  and 
infinity  when  specifying  the  total  probability  of  sign  inversion.  Figure  III-21 
illustrates  the  regions  in  the  £y(t),  e^(t)J  plane  corresponding  to  the  total 
probabilities  stipulated.  To  calculate  these  integrals,  a  knowledge  of  the  pro¬ 
bability  distribution  for  the  adaptive -filter  output  values  is  required.  Unfor¬ 
tunately,  it  is  not  available.  However,  the  adaptive -filte r  RMS  level  is  avail¬ 
able  for  a  number  of  data  samples.  Under  the  assumption  that  y(t)  and  the 
error  e^(t)  before  roundoff  are  normally-distributed,  independent  random 
variables  with  zero  mean,  the  total  probabilities  can  be  found.  For  integer 
arithmetic,  the  total  probability  of  sign  inversion  is 


0.  0  >  .v/tu 

0»  0  Or  1  Or  2  ' 'C'l 

(c)  Floating-Point  Data 


3  2  M  ' 

(b)  Data  Scaled  by  16 


FIGURE  III- 19 


PROBABILITY  THAT  y(t)  CHANGES  SIGN  OR  ROUNDS  TO  ZER< 
GIVEN  |y(t)|  [MOST  FAVORABLE  CASE:  v(tl  =  xftVl 


y(t)  rounds  to  zero: 
Filter  does  not  move 


(a)  Integer  Arithmetic 


(b)  Floating-Point  Arithmetic 
FIGURE  III- 2 1 

REGIONS  IN  THE  [y(t),  ey(t)]  PLANE  CORRESPONDING  TO 
SIGN  REVERSAL  AND  ROUNDING  TO  ZERO 


III- 78 


00 

f  — - — 

y/l/2 


7  2 

-(£2/2.  ) 

e  7  de 


cr  '/Zrr 

y 


~(yZ  /  2  crZ  ) 
e  y  dy 


=  2 


-(y2/2  cr2 ) 

6  %  “Sir r 


dy  , 


where  cr  is  the  adaptive -filter  output  RMS  level  and  F(z  )  is  the  standard 

y 

cumulative  normal  distribution  function 


F(z) 


z 

f  Vztt 


-?Z/2 

e  dz  . 


The  probability  of  rounding  to  zero  is 


„  /  2  2 
2  n/<t  +  CT 


/  2  2 

2  v/ct  +  a  * 

y  e 


For  floating-point  arithmetic,  the  probability  that  the  sign  of  y(t)  changes  is 
equal  to  the  probability  that  the  random  variable 


y(t) 
e  (t) 


lies  within  the  interval  -1  <  w<0.  The  random  variable  w  has  a  Cauchy 


distribution  with  probability  density  function 


p(w)  = 


2  2 

77"  (  1  +p  W  ) 


III- 7  9 


where  P  is  the  ratio  cr^/cr  of  the  standard  deviation  of  e  (t)  to  the  RMS 
level  of  the  adaptive  filter  output  y(t).  The  probability  of  a  difference  in  sign 
between  y(t)  and  y(t)  +  e  (t)  ,  therefore,  is 

y 

0 

— -  f  dw  _  wn"1  p 

n  J  ,  .  2  2  ‘  • 

-j  1  +  p  W  7 T 

The  arctangent  is  the  angle  from  the  (e  /cr  )-axis  to  the  line  P(  e  /o’  )  = 

y  e  y  £ 

-(y/^y)  after  the  probability  density  is  circularized  by  a  coordinate  trans¬ 
formation  which  divides  each  random  variable  by  its  standard  deviation.  This 
last  estimate  is  useful  even  in  integer  arithmetic  situations  where  the  error 
€^(t)  before  roundoff  has  a  standard  deviation  much  larger  than  one  half 
(e.  g.  ,  Figure  HI —20b).  Table  111-4  lists  the  probability  of  a  sign  change  or 

zero  value  in  the  filter  output  from  a  27  0°  adaptive  beam  (with  K  =  0.  005) 

s 

for  a  four-hour  noise  sample  from  day  238  of  1970  under  the  assumption  that 
all  filter  weights  are  of  magnitude  one  half.  These  probabilities  are  uncom¬ 
fortably  high.  They  indicate  that  sign  changes  and  zero  values  may  signifi¬ 
cantly  limit  adaptive-filter  performance.  Even  in  the  case  of  floating-point 

data,  digitization  error  results  in  a  surprisingly  high  probability  of  a  filter 
,  .  ,  .new  .old 

update  vector  A  -  A  pointing  in  the  wrong  direction.  The  gain  doubl¬ 
ing  performed  on  the  ALPA  seismometers  in  July  and  August  of  1972  should 
reduce  this  probability  by  a  factor  of  approximately  two. 

These  appreciable  errors  in  y(t)  ,  particularly  in  the  case  of  unsealed 
data,  may  account  for  the  radically  different  noise  reduction  values  at  data 
scale  factors  of  1  and  16,  respectively,  for  the  quiet  summer  noise  sample. 

At  the  convergence  rate  Kg  =  0.  30,  the  adaptive  filter  takes  advantage  of 
transient  correlation  between  successive  data  vectors  to  reduce  the  filter  out¬ 
put  power.  In  order  to  do  so  effectively,  the  adaptive  filter  must  react  rapidly 
and  accurately  as  each  new  data  vector  X  is  received.  If  the  filter  step 


III- 80 


MAXIMUM  PROBABILITY  OF  SIGN  INVERSION  OR  ROUNDING  TO  ZERO  IN  ADAPTIVE 


new  old 

-  A  pointed  in  the  wrong  direction  a  significant  fraction  of  the  time, 
the  drop  in  noise  reduction  from  6  dB  to  2  dB  when  a  data  scale  factor  of  1 
was  used  instead  of  16  could  easily  be  explained.  In  the  event  that  the  adap¬ 
tive  filter  output  y(t)  rounded  to  zero,  the  effect  would  be  somewhat  less 
severe:  the  filter  vector  would  not  update  and  could  not  react  to  incoming 
data  until  the  next  data  vector  was  available.  Each  time  no  update  was  per¬ 
formed,  some  of  the  ability  to  exploit  the  transient  correlation  between  con¬ 
secutive  data  vectors  would  be  lost. 

Without  knowledge  of  the  crosscorrelation  statistics  between  input 
data  channels,  the  unlikely  possibility  that  all  filter  weights  are  of  magnitude 
one  half  cannot  be  completely  excluded.  Part  2  of  this  subsection,  as  a  by¬ 
product  of  other  investigations,  gives  the  filter  weights  obtained  before  and 
after  the  error  vector  Ec  is  added  to  the  data  vector  X.  For  specific  cross- 
correlation  matrices,  the  absolute  values  of  the  resulting  filter  weights  could 
be  used  to  provide  more  accurate  estimates  of  the  error  e  (t)  before  round¬ 
off  in  the  adaptive  filter  output.  ^ 

2.  ^an^e  in  the  Mean  Square  Filter  Output  Power  y2(t)  Due  to 

Adding  the  Error  Vector  Ec  to  the  Data  Vector  X 

K  the  error  vector  E£  is  added  to  the  data  vector  X  and  the  round¬ 
off  error  e^t)  after  forming  the  dot  product  (X  +  E^A  is  incorporated 
into  the  adaptive -filter  output,  a  maximum -likelihood "filter  set  can  be  design¬ 
ed  to  minimize  the  total  output  power  subject  to  the  maximum -likelihood  con¬ 
straints  and  an  additional  special  constraint  which  reflects  the  fact  that  the 

term  ^(t)  is  preserved  with  unity  response.  The  adaptive -filter  output 

with  roundoff  error  is 


V'(.»=  {Q')TA'  =  [<x  +  E/!(r(t)]  [a].[qt|, ,„>][£] 
where  Q'  has  1+M(2N+1)  components,  A  is  subject  to  the  normal  maximum 


121-82 


likelihood  constraints  and  the  last  component  b  of  the  expanded  filter  vector 
A'  is  subject  to  the  special  constraint  b  =  1.  The  constraints  are  expressed 
in  the  matrix  equation 


[l-._  l]  [o---  o]  ---  - [0---  o] 

[o - o]  [l  —  1]  [o  —  o] . -[0---0] 

I  I 

!  x  ^  ^  1 

[0---0] . -'-[o--o]'[i—  i]'to-i-o] 

_[0_-  -_-_o]_-  ~  - -  -  ~  -  Jo  -_-  -_o]  [1  -  -  - 1] 

[0---  0] - - - [0---  0] 


where  D  is  the  (2N  +  1 ) -dimensional  vector 


d(-N) 

1 

1 

d(  —  1  ) 
d(  0  ) 
d(  1  ) 

1 

1 

1 

d(  N) 


and  each  of  the  row  subvectors  [l - 1]  or  [0 - 0]  in  the  constraint 

matrix  has  M  components  (where  M  is  the  number  of  channels).  In  matrix 
form,  it  may  be  written 


C'A1  =  D’ 

where  the  ordinary  maximum- likelihood  constraint  matrix  C  is  formed  by 
dropping  the  last  row  and  column  from  C',  and  where  the  expanded  response 
vector  D1  is 


III- 8  3 


D' 


C  is  a  (2N  +  2)  by  [l  +  M(2N  +  1)]  matrix,  A'  a  [l  +  M(2N  +  1 )]  -  dimen¬ 
sional  column  vector  reflecting  the  white  frequency  response  of  the  filter  A 
and  the  unalterable  response  of  one  on  the  error  e  (t)  generated  by  round¬ 
ing  the  adaptive  filter  output  to  the  nearest  integer.  The  adaptive -filter  up- 

data  equation  uses  the  method  of  steepest  descent  to  reduce  the  mean  square 

^  «—  ■  ■ 
output  power  [y'(t )]  using  the  Widrow  approximation  [y'(t)]2  =  [y'(t)]2 

(Widrow,  1966): 

( A')neW=  A'  -  M  V  |  [y'<t)] 2  "  2  [(A')T(D'-C'A')j 

=  A'  -  2 fi  [y'(t)  Q'  +  (C')T  A' j  , 

where  A'  is  the  JLagrangian  multiplier  vector 


A(-N) 


A(-i ) 
A(  o  ) 
M  i  ) 


M  N) 


The  Lagrangian  multiplier  vector  is  found  by  solving  the  vector  equation 


new  rr 

D'  =  C'(A')  =  C'A'  -  2 jj,  [y'(t)  C'Q'  +  C'(C')T  A']  : 


III -84 


f:  ' 

I  ■■  vffltaKwrv-f  *. 


A'  =  {yJT  [c'(C')T]  [c'A'  -  D']}  -  y'(t)  [c'(C')T]  C'Q' 

=  —  y 1  ( t)  [c'(C')T]  C'Q'  . 

Substitution  of  A'  into  the  update  equation  yields 

( A')neW=  A'  -  Z\l  y'(t)  |  I  -  (C')T  jc'  (C')T]  C'|Q'  . 

The  matrix  C'(C')T  is  the  (2N  +  2)  by  (2N  +  2)  diagonal  matrix 


M  0 


(1  M  0 - 0 

i  v  v  v  v  x  I 

I  I 

I  "v  "v  I 

0 . -0  "M  0 


0  0 


0  M  1  0 

0  0|1 

I 


j 


and  the  matrix  (C1)^  J^C '  (C')^  J  C'  is  the  [  1  +  M(2N  +  1)]  by 
[  1  +  M(2N  +  1)]  matrix 


III- 85 


M  JM 


-  0 

I 

I 

I 

-  0 


--0 

I 

^  • 
--"0 


M  JM 


'  o- 
1 

1 

1 

1 

[  / 

1  °/_ 

1 

1 

1 — 

o- 

i _ 

1 

L° - oj 

1 

i 

o  - 
1 _ 

0^-— cl 

1 

0 

i 

•'  : 

i 

o - :>o 

1 

1 

6 

~  t  ”* 

M  JM 


I 

I 

I 

0 

I 


V  • 
--- 0 


M  JM 


[° . °]  [° - °] . [o . o]  [o . o]  I 

I 


where  each  of  the  2N+1  submatrices  JM  is  an  M  by  M  matr  be  with  each  ele¬ 


ment  equal  to  1.  The  update  equation  reduces  to 


.new 


where  Q'  =  (C')T  |c'  (C')T 

4. 


(A1)  '  =  A'  +  2liy'(t)  (Q'  -  Q')  , 

-1 


C'Q'  ,  or  the  equivalent  equation 


r  newT 

r-  _ 

f ■ 

A 

+  2  n  y'(t) 

"Q  -  Q' 

L  l  J 

.  1  _ 

,-l 


where  Q  -  C  [cCT]  CQ  .  This  equation,  which  is  the  equation  for 
minimizing  the  total  output  power  [y'(,)]2  ia  precisely  the  equation  which 
IS  implemented  when  the  error  vector  Ec  is  added  to  the  data  vector  X  if 
no  roundoff  error  occurs  when  the  beamsteer  output  components 


III-86 


-  •  •  ■  ---t-  ■  .  .....  ... - 


r 


M 

qi(t'j)  =  Ta  ^  qi(t'j) 

i  =  l 

are  rounded  to  the  nearest  integer: 


.new 

A 


A  + 


2 M  (Q')TA' 


A  + 


2/J  [(X  +  Ec)Ta+  er(t)]  [<X  +  Ec)  -  (X  +  Ec)J. 


As  was  shown  in  Subsection  C,  the  beamsteer  roundoff  error  vector  E  can 
be  eliminated  by  scaling  the  vector  Q  by  the  number  of  channel! 


the  convergence  factor  ^  by  M  : 


Ls  and  dividing 


M 


i=l 


so  that  Mq.(t-j)  is  always  an  integer  and  no  roundoff  error  occurs.  By 

choosing  M  small  enoogh,  the  adaptive  filter  set  can  be  shown  to  converge 

to  the  optimum  maximum -likelihood  filter  set  if  certain  conditions  are  met 
(Daniell,  1968). 

This  relationship  between  the  adaptive  filter  set  and  the  optimum 
maximum-likelihood  filter  set  can  be  used  to  calculate  the  mean  square  filter 
output  power  [y'(t)j  2  with  digitisation  and  roundoff  error.  (71^  can  be 
compared  with  the  mean  square  filter  output  power  y2(„  in  the  absence  of 
such  errors.  The  adaptive  filter  output  with  these  errors  is 


y'(t)  =  y(t)  +  e  (t)  +  e  (t)  , 
y  r 


where  ey(t)  is  the  error  before  roundoff  and  e  (t) 
occurring  when  y(t)  +  Cy(t)  is  rounded  to  the  neare 


is  the  roundoff  error 
bt  integer.  The  quantity 


III- 87 


y(t)  +  6y(t)  may  be  broken  up  into  the  integer 

[y(t)  +  *  y<‘>  ]  1 

obtained  after  roundoff  and  the  fractional  term 

[y(t)  ♦  «y(t>]  F 

which  is  eliminated  by  the  addition  of  e  ^(t).  The  mean  square  noise  power 
[y'(t)]  is 

[y'(t)]2  =  [y(t)+  ey(t)]2  +  2  Cy(t)  +  ey(t)]I  €r(t) 

+  2[y(t)  +  ey(t)]F  er(t)  +  e2(t) 

When  the  standard  deviation  of  y(t)  +  €y { t )  is  larger  than  one  count,  its 
rounded  value  [y(t)  +  e y(t) ]  ^  has  only  negligible  correlation  with  the  round¬ 
off  error  e  (t)  ,  so  that  the  term 
r 

2  [y(t)  +  e  y (t)  ]  j  e  r (t) 

can  be  neglected.  On  the  other  hand, 

[y(t)  +  ey(t)]F  +  c  r(t)  =  0, 

so  that 

[y'(t)]2  -  [y(t)  +  (y<t)]2  -71^)  +  7^ 


III-  88 


=  [tM  +  -  er(t) 

=  AT  (X  +  E  )  (X  +  E  )TA  -  e2(t) 
c  c  r 

=  AT$A  -  e2(t)  , 

r 

where  the  matrix  (J)  is  the  crosscorrelation  matrix  corresponding  to  the 

2 

data  vectors  X  +  E  .  Minimizing  the  mean  square  filter  output  power  [y'<t>] 
is  the  same  as  minimizing  the  term  A^^A. 

When  the  filter  weights  are  subject  to  the  maximum-likelihood  con¬ 
straints  CA  =  D,  the  optimum  filter  set  is  achieved  when  each  component 
of  the  gradient 

V[l/2  AT$A+  AT(D  -  CA)J  =  $A  -  CTA 

is  zero.  L.e  JLagrangian  multiplier  vector  A  is  found  by  using  the  con¬ 
straint  conditions  CA  =  D,  so  that  the  matrix  equation 


r  '  n 

—  — 

1 

0 

1 

$>  !  ct 

i 

A 

1 

1 

1 

l 

l 

r 

1 

0 

0-. - 0 

c 

-A 

D 

•  o . -o 

—  *  _ 

_  _ 

yields  a  solution  for  both  the  filter  vector  A  and  the  Lagrangian  multiplier 
vector  A  .  By  rearranging  the  rows  and  columns  of  this  matrix  equation, 
the  matrix  can  be  put  in  block- Toplitz  form  (as  on  page  II- 3 ) .  The  filter 


III- 89 


output  power  AT$A  is  the  zero-tag  component  Mo)  of  the  Lagrangian 
multiplier  vector: 

=  cTA 

A  =  3>_1CTA 

CA  =  (C<J)'1CT)A  =  D 

A  =  (cc&^cVd 

A  -  $-1CT(C$-1CT)-1D 

at$a  =  dt(c$-1ct)-ic$-1$$-ict(c4>'1ct)-1d 

=  dt<c$-*cVd 

•  dta 

=  A(0)  . 


The  vectors  A  and  A  as  well  as  the  mean  square  noise  power  AT$A  are 

immediately  available  upon  solution  of  the  full  matrix  equation.  The  filter 

output  power  with  the  addition  of  the  error  vector  E  and  the  roundoff  error 
er(t)  is 


DT<C*  cV  D  -  <2m 


Ax+E<°>  -  %<‘> 


where  the  subscript  X+E  denotes  the  addition  of  the  error  vector  E  to  the 

data  vector  X.  Without  digitization  error  and  roundoff  error,  the  filter  out- 
put  power  is 


DT(C®  cV  D  =  A  (0)  , 


III  -90 


where  the  subscript  X  denotes  the  data  vector  X  by  itself.  The  ratio  of  the 

2 

altered  mean  square  filter  output  power  [y'(t)]  to  the  original  mean  square 

. . .  2 
filter  output  power  y  (t)  is 


XXfE<°>  - 

v°> 

Provided  that  the  mean  square  error  in  predicting  one  channel  from 
any  other  channel  at  any  time  shift  up  to  NAt  is  larger  than  1  count2,  a 
channel  value  rounded  to  the  nearest  integer  (and  the  roundoff  error  on  that 
channel)  should  have  negligible  correlation  with  the  roundoff  error  on  any 
other  channel.  As  a  result,  crosscorrelation  values  between  two  different 
channels  should  be  unaffected  by  digitization  and  roundoff  error.  Autocorrela¬ 
tion  values,  however,  are  affected.  Under  reasonably  weak  assumptions  con¬ 
cerning  the  lack  of  perfect  correlation  between  triax  components  at  each  site, 
between  successive  values  of  the  unfiltered  vertical  component  at  each  site, 
and  between  successive  values  of  the  prefiltered  vertical  component  at  each 
site,  the  autocorrelation  values  after  allowing  for  digitization  error  and 
roundoff  error  should  be 


x.'(t)  x  !(t  +  T  )  =  x  .(t)  x.(t  +  T  )  - 


1  2  2 

T  +  * 

3d  v 


<PAt)  -  Sc 
b  or  x 


2 

where  the  primes  indicate  values  with  error,  where  e  is  the  variance  of 

the  digitization  error  on  each  triax  sensor,  where  e  is  the  variance  of 

v 

the  error  in  rounding  each  unfiltered  vertical  component  to  the  nearest  integer 
where  is  the  variance  of  the  error  in  rounding  off  each  prefiltered  ver¬ 

tical  component,  where  ( pAr )  is  the  autocorrelation  function 

D 

L 

<Mt)  =  Y  b,  b, 

b  .  k  k  +  r 

k=-  L 


III- 91 


of  the  nr e filter  applied  to  the  unl'iltered  vertical-component  data,  and  where 

~T 

5  is  the  Kronecker  delta  operator  (the  variance  t  is  subtracted  only 

or  x 

when  T  -  0  ).  Floating-point  autocorrelation  functions  are  affected  only  by 

digitisation  error; 


x  !(t)  x  '(t  +  t) 
i  l 


x  .(t)  x.(t  +  T ) 
l  l 


3 


~T 

ed 


^b(r) 


In  the  case  of  specific  noise  matrices  ,  these  results  provide  a 

convenient  technique  for  determining  the  change  in  the  mean  square  noise 

2 

power  y  (t)  from  errors  in  the  data  vector  X  and  from  roundoff  error 

T 

after  the  dot  product  (X  +  E  )  A  has  been  formed.  A  knowledge  of  the 

c 

noise  environment  in  which  an  adaptive-filtering  system  is  to  operate,  to¬ 
gether  with  a  computation  of  the  digitisation  and  roundoff  error  statistics 
associated  witli  a  particular  implementation  scheme,  is  sufficient  to  set 
specifications  on  the  A/D  system  and  on  the  precision  of  the  data  vector  X 
so  as  to  come  arbitrarily  close  to  the  mean  square  noise  level  A^O)  for 
data  without  digitisation  or  roundoff  error.  (Further  specifications  on  the 
precision  of  the  filter  vector  A  are,  of  course,  also  necessary.  ) 


In  one  data  sample  with  an  enormous  signal,  it  was  necessary  to  re¬ 
duce  the  data  scale  factor  from  16  to  4  to  avoid  clipping  the  signal.  After 
three  runs  were  made  at  scale  factors  of  16,  8,  and  4,  the  noise  reduction 
values  over  the  section  of  data  preceding  the  signal  were  sufficiently  inter¬ 
esting  to  make  two  additional  runs  at  scale  factors  of  2  and  1.  The  noise 
reduction  results  in  Table  111-6  are  for  the  period  0430  to  0638  on  day  7  of 

1972.  An  adaptive  filter  was  steered  toward  263°  at  a  convergence  rate 

2  , 

K  =  0.  006.  Decibel  values  in  this  table  are  relative  to  1  count  /second, 
s 

With  the  exception  of  the  varying  data  scale  factors,  all  processing  para¬ 
meters  are  as  described  in  Subsection  A  of  Section  V.  Note  that  the  beam- 
steer  output,  power  increases  by  6.  181  dB,  6.  098  dB,  6.  061  dB,  and  6.  041  dB, 


111-92 


00 

%0 

aJ 

X 

4-» 

rti 

JD 

-O 

-D 

Q 

XI 

X5 

XI 

X) 

0) 

0) 

<u 

<D 

TJ 

p-H 

p-H 

oJ 

oJ 

p-H 

u 

u 

u 

u 

rd 

CO 

to 

to 

CO 

U 

w 

fti 

flj 

nj 

•*-> 

4-J 

C 

nJ 

nJ 

D 

Q 

Q 

Q 

Q 

respectively,  each  time  the  data  scale  factor  is  doubled.  Similarly,  the 
adaptive  filter  output  power  increases  by  6.  26C  dB,  6.  126  dB,  6.  080  dB,  and 
6.  051  dB.  Without  roundoff  error,  these  figures  should  increase  by  20  log  2 
(6.  021  dB).  The  actual  results  reflect  the  fact  that  roundoff  error  reduces 
the  nean  square  noise  power  by  a  relatively  stable  number  of  squared  counts. 
The  only  significance  of  these  results  is  that  zero  output  is  obtained  by  scal¬ 
ing  the  data  down  to  the  point  where  the  largest  value  x.(t-j)  in  the  data  lies 
between  -1/2  and  1/2  before  it  is  rounded  off.  The  same  effect  occurs  when 
the  crosscorrelation  matrix  is  formed  from  data  containing  a  signal.  In  fact, 
signal  power  might  be  reduced  more  than  noise  power.  Crosscorrelation 
terms  as  well  as  autocorrelation  terms  in  an  ideal  signal  matrix  are  affected; 

s|(t)  s'(t+T)  =  7:]t)  s.'(t+T) 

*■  K  1  1 

=  V«)  .,(«+r(  -  [i  .y  ♦  .y]*b,r>  -  5or  J  . 

Ac  the  data  is  scaled  down  to  the  point  where  the  mean  square  error  in  pre¬ 
dicting  one  channel  from  another  drops  well  below  1  count2,  the  crosscorrela¬ 
tion  values  need  adjustment.  And  this  condition  is  more  readily  achieved 
when  a  signal  is  present. 

One  important  fact  can  be  inferred  from  this  study  of  the  effects  of 
error  in  the  data  vector  X.  It  is  that  the  principal  effect  of  roundoff  error 
and  digitization  error  at  very  low  convergence  rates  is  simply  to  reduce  the 
autocorrelation  function  values  in  the  matrix  <J>  =  XX T  when  a  significant 
amount  of  power  (in  terms  of  computer  counts)  is  uncorrelated  between  sen¬ 
sors.  Normally,  the  reduction  of  the  autocorrelation  functions  causes  a  drop 
in  the  filter  output  power.  Aside  from  this  scaling  effect  on  signal  and  noise 
alike,  the  error  appears  at  very  low  convergence  rates  to  have  no  major  con¬ 
sequences  in  terms  of  the  mean  square  filter  output  once  spatially  uncorrelated 


III- 94 


noise  rises  to  certain  minimal  computer -count  levels.  This  statement  applies, 
of  course,  only  to  situations  where  the  convergence  rate  is  slow  enough  that 
the  adaptive  filter  set  closely  approximates  the  optimum  maximum-likelihood 
filter  set.  It  does  not  apply  to  the  effect  of  digitization  and  roundoff  error  on 
misadjustment,  which  is  defined  to  be  the  fraction 


2 

y  (t) 

[y2(t>] 

optimum 

2 

y 

(t) 

optimum 

of  additional  noise  caused  by  the  adaptive  algorithm  in  a  time  -  stationary  en 

vironment.  The  drop  in  noise  reduction  from  6  dB  to  2  dB  at  K  =  0.  30 

s 

clearly  illustrates  this  fact  in  the  case  of  the  quiet  summer  noise  sample 
where  data  scale  factors  of  16  and  1  were  used. 


An  appropriate  design  goal  would  be  to  reduce  the  difference  between 
the  mean  square  noise  output  values  with  and  without  error  to  the  maximum 
extent  feasible.  In  that  event,  at  least,  processing  results  would  increasingly 
reflect  instrument  measurements  rather  than  computational  noise. 


E.  DIRECTIONAL,  ERROR  OF  THE  FILTER  UPDATE  VECTOR 


(A"eW  -  A°ld) 


In  this  subsection,  the  effects  of  rounding  off  the  filter  update  vector 
are  investigated.  Part  1  discusses  only  the  roundoff  problems  involved  in  the 
direct  calculation  of  the  update  vector.  After  this  calculation,  the  constraint 
conditions  may  no  longer  be  satisfied.  As  has  been  pointed  out  in  the  litera¬ 
ture  (Frost,  1972),  the  cumulative  effect  of  repeated  calculations  not  satisfy- 
ing  the  constraint  conditions  is  to  produce  a  random  walk  away  from  the  con¬ 
straint  space.  Some  corrective  procedure  must  be  performed  periodically 
to  bring  the  filter  set  back  into  reasonable  agreement  with  the  constraint  con¬ 
ditions.  Part  2  considers  the  implications  of  various  corrective  methods  on 


III- 95 


-A  f  s  L:—  -  - ^  ^  --  ' 


th,  directional  error  of  the  filter  update  vector.  Par,  3  briefly  describes  the 

disadvantages  of  a  procedure  which  eliminates  filter -weigh,  roundoff  error 
and  beamsteer  roundoff  error. 

1.  Error  Due  to  Rounding  Off  the  Filter  Weights  a.(j) 

The  average  change  in  an  individual  component  of  the  filter  weight 
vector  A  is  a  critical  quantity  in  evaluating  the  error  due  to  rounding  off  the 
filter  weights.  An  individual  component  may  be  immobilised  or  partially  im¬ 
mobilised  when  the  average  change  drops  significantly  below  the  quantisation 
level  of  the  filter  weights.  Even  when  the  average  change  is  approximately 
the  same  as  the  filter  weight  quantisation  level,  the  fact  that  some  filter 

weigh,  components  a.  (j )  will  be  hampered  more  than  others  in  their  move  - 

men,  means  that  the  filter  vector  A  may  be  deflected  from  the  desired  direc- 
tion  of  movement  after  several  iterations. 

The  squared  magnitude  Ia“W  -  A0ltl|2  of  the  filter  update  vector 


(Anew  _  Aold)T  (Anew  _  AoId) 


4Ks  aTxx'TA  (X-X)T  (X-X) 
(X-X)  (X-X)  (X-X)T  (X-X) 


*KZS  y2(t) 

(X-X)T  (X-X) 


for  the  particular  adaptive  algorithm  used  in  this  report.  With  some  mathe- 

matical  manipulations,  a  clearer  picture  emerges  of  the  factors  influencing 
the  filter  step  size: 


Anew  _  Aold|2  =  4K!  V2^ 

(X-X)T(X-X) 

4Ks  *2(t)  E  f y2(t)] 

E  [(X-X)T(X-X)] 


4K2  *2(t) 
s 


E 

—  T  — 

Lx  Xj 

E 

Qx  -X)  T(X  -X)l 

E 

2 

[y  (t) 

E  [xT  x] 

4K2  *2(t) 

_ s 

M(2N  +  1)  E  [^(t)1  e[xTx]  -  e[xTx] 

E  (VV)]  e  [xTx] 

4K2  ^2(t) 

_  s 

M(2N  +  1)  R2  (R2  -  1) 

A.  o 


where  the  operator  E 
the  brackets,  where 


[--] 

'J'(t) 


denotes  the  time  average  of  the  quantity  inside 
is  the  random  variable 


where 


'jk(t)  =  /  _ y2^)  /  (X-X)T  (X-X) 

V  E  [y2(t)J  /  e[(X-X)T(X-X)] 

is  the  beam  output  power  reduction 


III- 97 


is  the  power 


R 


E 

.x2(t)] 

E 

y 2  <  t  )J 

of  adaptive  filtering  relative  to  beamsteering,  and  where  R 


reduction 


B 


R 


E  I  XTX 


B 


E  [xTX 


of  beamsteering  relative  to  the  average  single  -  sensor  power.  R^  is  normally 
close  to  the  value  M  (where  M  is  the  number  of  channels)  provided  that  the 
noise  field  is  spatially  uncorrelated  or  coherent  but  distributed  over  a  range 
of  look  directions  not  impinging  on  the  main  lobe  of  the  time-shift-and-sum 
array  beam  pattern.  Under  the  assumption  that  R^  =  M,  the  RMS  change  in 
one  component  of  the  filter  weight  vector  is 


new  old 
A  -  A 


2K  xjr (t) 

s 


M(2N  +  1) 


M(2N  +  1)  VmTi  r 


It  is  the  RMS  change  in  the  sense  that  it  is  the  square  root  of  the  squared  com- 
ponents  a.  (j)  averaged  over  all  components.  The  quantity  in  question  still 
fluctuates  as  a  function  of  time.  If  the  time  averages  E[---]  are  taken 


over  time  intervals  in  which  the  power  reduction  of  adaptive  filtering 


relative  to  beamsteering  is  stable,  the  RMS  change  in  a  filter  component  is 
the  product  of  the  stable  value 


2K 


M(2N  +  l)y/uTl  R 


and  the  time -varying  random  variable  *(t).  In  view  of  the  definition  of  *(t), 


III- 98 


,..-n;w-rr 


..  ,-e.y- - - -  . -ea* 


it  is  likely  that  the  mode  point  of  the  probability  distribution  for  \{/(t)  is 
close  to  1.  As  a  result,  the  mode  point  of  the  probability  distribution  for  the 
RMS  change  in  one  filter  component  a.(j)  is  close  to 

2K 

_ s _ 

M(2N+1)  / M-l  R 

At  the  convergence  rate  =  0.  005  and  the  amplitude  reduction  factor 

R.  =  1.  333  (about  2.  5  dB)  for  adaptive  filtering  relative  to  beamsteering, 

**  - 1 6 

this  quantity  is  1/65536,  or  2  .  The  quantization  level  of  the  filter 

weights  for  the  adaptive  processor  used  in  this  report  is  2  (A  filter 

weight  of  one  half  is  represented  as  32768  counts,  )  Thus  the  RMS  filter- 
weight  change  was  close  to  one  count  when  the  adaptive  filter  output  power 
was  2.  5  dB  below  the  beamsteer  output  power.  And  the  actual  noise  reduc¬ 
tion  obtained  at  a  convergence  rate  K  =  0.  005  was  typically  below  2.  5  dB, 

s 

so  that  the  mode  point  of  the  probability  distribution  for  the  RMS  filter  - 
weight  jump  was  probably  within  the  range  1  to  1.  333  counts. 

If  the  probability  density  function  for  the  random  variable  \J/(t)  were 
strongly  peaked  near  a  value  of  one,  there  would  be  considerable  cause  for 
alarm  when  the  RMS  ?e  in  one  component  of  the  filter  weight  vector  drop¬ 
ped  below  1/2  count,  in  that  event,  most  of  the  components  would  be  immobil¬ 
ized.  A  broadly  dispersed  probability  distribution  for  vj^(t),  on  the  other  hand, 
would  permit  movement  when  ^(t)  reached  its  highest  values  (even  if  the 
rate  might  be  biased  either  upward  or  downward,  but  at  least  the  filter 

vector  would  move.  For  this  reason,  the  fluctuation  of  \|/(t)  as  well  as  the 

2 

RMS  change  in  a^(j)  warrants  scrutiny.  The  random  variable  \|/  (t)  may 
be  expressed  as  the  product 


III- 99 


*  (t)  = 


v2(t) 

E  [(X-X)T(X-X)1  ( 

E  [y2(t)] 

(X  -X)T(X  -X)  ] 

2 

(t)  can  be 

attributed  to  the  term  on 

the  filter  output  y(t)  oscillates  back  and  forth  between  positive  and  negative 
values,  ordinarily  reaching  an  amplitude  greater  than  its  RMS  level.  The 
term  on  the  right,  however,  does  not  fluctuate  nearly  as  much  since  it  is  a 
power  average  over  2N  +  1  consecutive  points  in  time.  The  31-second 
length  of  the  adaptive  filter  used  in  this  study  spans  3  quarter -cycles  of  the 
40-second  period  at  which  the  bias-removal  prefilter  response  (as  well  as 


the  filtered  data  power  spectrum)  begins  to  roll  off.  Furthermore,  power 

surges  in  y  (t)  tend  to  compensate  for  power  surges  in  (X-X)T(X-X). 

Since  the  point-to-point  fluctuations  in  y  (t)  have  no  counterpart  in  any  other 

term  affecting  V  ( t ) ,  it  is  reasonably  safe  to  assume  that  'j'^(t)  varies  at 

2  2 

least  as  much  as  y  (t)/E[y  (t)]  .  There  are  a  number  of  probabilistic 


models  that  could  conceivably  describe  the  fluctuations  in  y(t).  Just  which 
one  is  most  appropriate  depends  on  the  data.  A  reasonable  assumption  for 
the  sake  of  discussion  is  that  y(t)  is  normally  distributed.  In  that  event,  it 
is  easy  to  determine  whether  a  "typical"  filter -weight  component  is  in  danger 
of  being  immobilized  by  roundoff  error  when  it  is  lounded  to  the  nearest  filter- 
weight  count.  By  a  "typical"  component  is  meant  a  filter -weight  component 
whose  RMS  change  over  time  agrees  with  the  RMS  filter -weight  change  aver¬ 
aged  across  components.  Table  III  -  6  gives  the  probability  of  no  movement 
and  the  standard  deviation  of  the  "typical"  filter-weight  component's  motion 
after  roundoff  at  four  RMS  filter -weight  change  values.  In  this  table,  only  the 
variation  in  *  (t)  due  to  the  term  y2(t )/  E[y2(t)]  has  been  considered,  and 
the  mean  of  the  movement  in  the  "typical"  filter -weight  component  has  been 
ignored.  Under  these  assumptions,  it  is  apparent  that  the  "typical"  compon¬ 
ent  is  severely  impeded  in  its  movement  only  when  the  RMS  filter -weight 
change  drops  below  1/4  count. 


Ill-  100 


-  •  ******** 


TABLE  III  -  6 


PROBABILITY  OF  NO  MOVEMENT  AND  STANDARD  DEVIATION  OF  THE 
CHANGE  IN  A  FILTER- WEIGHT  COMPONENT  WHOSE  RMS  CHANGE 
OVER  TIME  IS  THE  SAME  AS  THE  RMS  FILTER- WEIGHT 
CHANGE  ACROSS  COMPONENTS 


RMS  Filter  -  Weight 
Change  (counts) 

Probability  of 
No  Movement 

Standard  Deviation 
of  the  Change  in  a 
Typical  Component 
(counts) 

1.  000 

0.  3830 

1.  04 

0.  500 

0.  6826 

0.  57 

0.  250 

0.  9546 

0.  21 

0.  125 

1. 0000 

0.  00 

III-  101 


In  reality,  a  wide  variation  in  average  movement  can  be  expected 
across  the  components  of  the  filter  vector  A.  It  is  well  within  the  realm  of 
possibility  that  the  bulk  of  the  motion  could  be  concentrated  in  a  small  number 
of  components.  At  an  RMS  filter -weight  change  of  1/4  count,  therefore,  it  is 
likely  that  most  of  the  filter  weights  are  greatly  restricted  in  their  activity. 

As  a  result,  a  safety  margin  of  several  bits  in  the  numerical  representation 
of  the  filter  weights  seems  appropriate.  On  the  other  hand,  it  is  true  that 
those  components  most  critical  to  adaptive -filter  performance  do  move  at 
the  1/4-count  RMS  change  value.  One  unambiguous  result  can  be  derived 
from  this  analysis:  the  filter  vector  proper  is  unlikely  to  move  at  all  when 
the  RMS  filter-weight  change  goes  below  l/[^8v/M(2N  +  l)]counts;  at  that 
level,  Table  111-6  implies  that  the  largest  change  in  any  component  rounds 
to  zero  even  if  all  of  the  change  is  concentrated  in  a  single  component.  The 
empirical  results  of  Subsection  IV-B  indicate  that  noticeable  degradation  in 
filter  performance  begins  to  occur  when  the  RMS  change  in  an  individual 
filter  weight  drops  below  1/2  count. 

Due  to  the  fact  that  the  error  in  rounding  off  a  filter  weight  never 
changes  the  sign  of  the  filter  weight,  it  is  possible  to  place  some  limits  on  the 


,  new 


angular  error  in  the  vector  ( A“~ "  -  A°ld).  It  is  never  more  than  90°  different 
from  the  vector  sgn[y'(t)]  (X-X  +  E  +  E  )  ,  where  sgn  is  the  function 


sgn  z 


- 1  if  z  <  0 
0  if  z  =  0 
1  if  z  >  0 


where  y'(t)  is  the  adaptive  filter  output  after  roundoff,  and  where  X-X,  E  , 

b 

and  are  as  defined  previously.  A  slightly  more  useful  bound  can  be  found. 

Provided  that  the  filter  vector  moves,  the  worst  possible  situation  that  can 
happen  is  that  one  component  changes  sligntly  more  than  1/2  count  while  all 
other  components  change  just  slightly  less  than  1/2  count  and  round  to  zero. 


111-102 


t  ,  .  .  ,  ,  .new  oldt 

In  that  event,  the  angle  between  (A  -  A  ) 

sgn  [y'(t)]  (X-X  +  E  +  E  )  is 

b  c 


after  roundoff  and 


tan  _1  n/  M(2N  +  1)  -I 


or  85  48’  when  M(2N  +  1)  =  186. 

Up  to  this  point,  no  accurate  estimates  of  the  angular  error  in  the 
...  ,  .  .  new  .old 

filter  update  vector  (A  -  A  )  have  been  made  because  the  change  in 
the  fiite r -weight  components  a.(j)  before  roundoff  has  been  smaller  ^han 
1  count  for  at  least  some  components  in  the  situations  examined.  If  all  com¬ 
ponents  change  by  at  least  several  counts,  simplifying  assumptions  which 
facilitate  mathematical  treatment  are  possible.  In  adaptive  filtering  systems 
where  the  process  of  rounding  the  filter  weights  affects  the  vector  (AneW-A°^) 
only  slightly,  all  or  almost  all  of  the  filter-weight  components  do  change  by  at 
least  several  counts.  Hence  it  is  worthwhile  examining  the  results  inferred 
from  the  simplifying  assumptions. 


The  most  easily  derived  result  is  an  upper  bound  for  the  angle  between 


the  vector 


sgn 


y'(t)  (X-X  +  E  +  E  )  and  the  vector  (AnCW  -  A°^)  after 
J  be  - 


roundoff.  Prior  to  rounding  the  filter  update  vector  to  the  nearest  filter - 
weight  count,  the  filter  update  equation  is 


new  old 
A  -  A 


2K  y'(t)  (X-X  +  E,  +  E  ) 
s  be 


X-X  +  E,  +  E 
b  c 


K  (X-X  +  E,  +  E  ) 
b  c 


where  q  is  the  quantization  level  of  the  filter  weights  (2  ^  for  the  algorithm 
used  in  this  report),  and  where  K  is  the  scalar  constant 


111-103 


K  = 


2Kg  y'(t) 


q I  X-X  +  E  +  E 
b  c 


m  ,  I  ,  >  .  new  old.  . 

To  the  vector  (A  -  A  )/q  is  added  the  roundoff  vector 


D  = 


(-N) 


SM  <"N> 


\  (  0  )' 


L5M  (  0  > 


61  (  N  ) 


L8m 


The  squared  magnitude  of  the  vector  (A‘ieW-  A°ld)/q  is 


4K2 

s 

y'(t) 

2 

2  - 
q  x 

-X  +  E  +  E  2 

b  c 


.  2  new  old 

in  counts  .  The  vector  (A  -A  )/q  after  roundoff  must  lie  within  I  D  I 
r  ..new  old,  . 

counts  of  (A  -  A  )/q  before  roundoff.  This  situation  is  illustrated  in 


III- 104 


Figure  111-22.  The  largest  angle  y  between  K(X-X  +  +  E^)  +  D  and 

K(X-X  +  E  +  E  )  occurs  when  D  is  perpendicular  to  K(X-X  +  E  +  E  )+  D, 
be  DC 


so  that 


,  q2  IdI2  |X-X  +  E.  +  E  I' 

. 2Y  ^  be 

4K2s  [,■(«)] 


Since  the  largest  possible  absolute  value  of  any  component  g.(j)  in  D  is 
7 

1/2  | D I  =  M(2N  +  1 ) / 4 .  Provided  that  the  filter  vector  moves  after 

’  max 

roundoff,  the  minimum  absolute  value  of  y 1  (t)  is  1.  Therefore 

„  46,  5  q2  |  X-X  +  E  +  E  |  2 

2  be'  max 

sin  y  <  - - - 


for  a  1 86 -  component  filter  vector.  In  a  four-hour  noise  sample  from  day  238 

—  2  2 

of  1970,  the  largest  value  |X-X  +  Eb  +  Ej  observed  was  18,  617,  020  counts 

when  the  data  was  scaled  by  a  factor  of  16,  Under  the  assumption  that  the 
—  2 

largest  value  |X-X  +  E  +  E  |  for  unsealed  data  was  1/256  as  large,  Table 
&  be 

111-7  lists  the  maximum  possible  angle  between  the  filter  update  vector  before 

roundoff  and  the  filter  update  vector  after  roundoff  at  the  convergence  rate 

K  =  0.  005.  This  table  indicates,  in  the  case  of  unsealed  data,  that  a  24-bit 
s 

filter  vector  guarantees  an  angle  7  considerably  smaller  than  the  average 
angle  <P  between  the  vector s  (X-X)  and  (X-X+E^E^)  at  the  conver¬ 
gence  rate  K  ='0.005.  Twenty-eight  bits  accomplish  the  same  purpose  for 
s 

data  scaled  by  16.  When  the  data  is  scaled  by  16,  or  course,  it  is  far  less 
likely  that  the  rounded  adaptive  filter  output  y 1  (t)  has  an  absolute  value  of 
1.  The  angular  bounds  in  Table  111-7  are  extremely  conservative.  Whenever 

the  maximum  angle  is  below  85  481,  the  following  conditions  must  occur 

I-  l  2 

simultaneously  to  achieve  it:  (1)  the  squared  magnitude  |X-X  +  E^  +  E^| 


111-105 


D+K(X-X  +  E,  +  E  ) 
b  c 


FIGURE  III-22 

TWO-DIMENSIONAL  PLANE  PASSING  THROUGH 
THE  ORIGIN,  D,  AND  K(X-X  +  E  +  E  ) 


III- 106 


TABLE  III- 7 

MAXIMUM  POSSIBLE  ANGLE  OF  DEFLECTION  DUE  TO  ROUNDING 

THE  FILTER  UPDATE  VECTOR  AT  THE  CONVERGENCE  RATE 

K  =  0.  005  (USING  NOISE  DATA  FROM  DAY  Z38  OF  1970) 
s 


Log^( 1 / q ) 


Unsealed  Data 

Data  Scaled  by  16 

Maximum  Angle 

lu 

20 

85°48 1 

17 

21 

85°48 ' 

18 

22 

44°33 ' 

19 

23 

2  0°32 ' 

20 

24 

NO 

o 

o 

21 

25 

5°  2' 

22 

26 

2°31 1 

23 

27 

1  °  1  5  ' 

24 

28 

0°38 1 

must  be  at  its  maximum  value;  (2)  the  rectified  filter  output  |y'(t)|  must 
be  one,  (3)  the  vector  D  must  be  perpendicular  to  the  vector  K(X-X+E  +E  )  + 
D;  and  (4)  each  component  of  D  must  be  of  magnitude  1/2. 

The  vector  (A  -  A  )  without  any  digitization  or  roundoff  error 
points  in  the  same  direction  as  the  vector  sgn[y(,)]  (X-X).  With  the  errors 
incorporated  in  the  vectors  Eb,  Ec,  and  D,  it  points  in  the  same  direction  as 
the  vector  D  +  I K  |  sgn  [y'(t)J  (X-X  +  Efa  +  E^).  Since  the  filter-weight 
roundoff  vector  D  is  symmetrically  distributed  about  the  origin  and  is  zero 
when  y'(t)  =  0,  the  specification  of  the  probability  distribution  for  the  angular 
error  in  (  -A  )  reduces  to  the  determination  of  the  probability  distribu- 
tion  for  the  angie  between  |  K|  (X-X)  and  sgn  [y(t)]  sgn  [y  ■(»>]  [|K|(X -X  +  E  + 

Ec,  +  D:-  ProbabiIity  distributions  for  Eb  and  Ec  were  derived  previously 
in  Subsection  C.  Likewise,  the  probability  distribution  for  sgn  [y(t)J  sgn  [y.(tj] 
discussed  in  Subsection  D,  As  a  first  step  in  finding  the  probability  dis¬ 
tribution  for  the  angle  of  error  in  <AneW-A°ld)  the  filter-weight  roundoff 
vector  D  needs  to  be  examined 


Prior  to  rounding  the  filter  weights  at  the  j-th  lag,  an  individual  com- 
ponent  of  the  filter  vector  is  of  the  form 


! 


K  j*  (t-j)  -  x.(t-j)  +  e.(t-j)  - 


M 


v  Z  V‘-j) 


k=l 


+  eb(t 


-j)| 


It  consists  of  an  integer  part  (the  component  rounded  to  the  nearest  intege-) 
and  a  fractional  part  which  is  eliminated  by  the  roundoff  procedure.  Thus,  it 

is  clear  that  the  component  S.(j)  of  the  vector  D  is  simply  the  negative  of 
fractional  part  of  the  expression  above.  Since  the  vectors  (X-X)  and  E 

satisfy  the  constraint  conditions  and  since  e^t-j)  is  the  same  for  all 

channels  at  the  j-th  lag, 


III- 108 


- 


iv"  r  m  1 

if  -  -X(H|-  xi<l-j)  +  'i(t-jl  -  TX  Eek«‘-j>  ♦«„<'-»  -  km«  o 

i=l  u-i  / 


The  fractional  parts  of  the  left-hand  side  of  the  equation  naust  sum  to  an  in- 
teger  if  f he  right-hand  side  is  to  equal  zero: 


IVI 

[KM<b(t-j']F  +  E  -[si(j>] 


- 1.  , 

j 


where  I.  is  an  integer  and  where  the  subscript  F  denotes  the  fractional  part 
of  KMeJt-j).  This  equation  specifies  a  constraint  on  the  roundoff  errors 
S.(j).  Any  one  of  the  components  is  a  function  of  the  other  components  and 
Let  it  be  the  M-th  component: 


-  Vj)  =  |KM  s<t-j)  F  + 


E  6i,j 


o)  - 1. . 


KM  <b(t-j)  p  + 


***  X 

E  *.< 


+  KM  £b(t-^F  + 


AVI  -  I 

-jJF  +  E  «.o»|  -  ■ 


where  the  subscript  I  denotes  the  integer  part  of  the  quantity  inside  the 

braces.  Since  the  negative  of  the  M-th  component  of  the  roundoff  error  lies 
between  -1/2  and  1/2,  it  is  equal  to  the  fractional  part  of  the  right -hand 

side  of  the  equation: 


1V1-  i 

-sMU>  ={[™<bH]F  ♦  £  5i<j>} 


III- 1  09 


Since  SM(j)  has  no  integer  part,  the  integer  par t  of  the  right -hand  s ide 
must  be  zero,  so  that 

M-l 

1.  -  {[km  .b(.-j)]F  ♦  £  6JJ)  } 

i=l  1 

Given  probability  distributions  for  ^(t-j)  and  the  first  M-l  components 
S.(j),  the  probability  distribution  of  the  integers  I  can  be  ascertained.  The 
probability  of  any  integer  I.  is  the  probability  that 

M-l 

I.  -  1/2  <  [KM  eb(t-j)]F  +  8.(j)  <  I .  +  1/2  . 


Another  useful  way  of  expressing  I  is 

j 


,  r  ir  M-i 

.  v  j  [KM  eb*,'j)J f  +  [KMeb(t-i)ji  +  Y  *iO)j  - 


KM  €b(t-j] 


KMfb(t-j)  + 


Y  -  hv-i-li 

i=i 


■IVl  -  1 

Let  L.  denote  the  integer  part  of  KMyt.jH  J  6,(j)  .  Its  probability 
the  probability  that  i=l 


1/2  <  KMtb(t-j)  +  ^  5 . ( j )  <  L.  +  1/2 


An  important  random  variable  is  th 


e  sum 


III-  110 


1V1 


I.  -  KM  e  (t-j) 
J  D  J; 


L .  -  KMeJt-j)  , 


whose  probability  distribution  depends  on  the  probability  of  KMe^t-j)  and 

the  conditional  probability  of  L.  given  KM  t  (t-j). 

j  b 


Let  z.  denote  the  filte r -weight  change 


z.  =  K 
1 


l  M  1 

js(t-j)  -  x.(t-j)  +€.(t-j)  -  ±  +  *b(t-j)| 

k=l  ’ 


of  the  i-th  channel  at  lag  j  before  roundoff.  The  roundoff  error  $  (j)  de¬ 
pends  only  on  the  fractional  part  of  z.  .  Any  integer  change  in  z  results  in 

i  i 

the  same  roundoff  error  S.(j)  =  -  (z.)  ,  so  that  the  roundoff  errors  5  (j)  for 

l  l  r  i 


any  point  in  any  (M-l)-dimensional  cell 


m.  -  1/2  <  z.  <  m.  +  1/2 
l  li 


(i  I*  ^ i  •  •  •  ,  M  —  1  j  m .  an 
integer) 


are  the  same  as  the  roundoff  errors  5.(i)  for  the  corresponding  point  in  the 
cell 


-  1/2  <  z.  =  (z.)  <  1/2 

l  l  F 


(i  =  1,  2,  ....  M-l), 


which  includes  the  origin  (z  ^ ,  z^ ,  .  •  .  »  z^  j )  -  (0,  0,  ...,  0).  Consequently, 
the  sum 


.KMeb<*-j>]F  +  E  si<« 


hi- i  i  i 


also  remains  invariant  under  integer  translations  of  the  variabl 
more,  the  regions 


e  z.  .  Further- 

l 


M-l 

L  -  1/2  <  |^KMeb(t-j)J  +  g.lj)  <  i .  +  1/2 


corresponding  to  the  integers  I.  are  identical  in  each  of  the  (M- 1 ) -dimensional 
cells.  Figure  III-23  illustrates  this  situation  when  [KMf^t.j)]^  =  0  for  the 
three-channel  case.  I.  =  -1  in  the  upper  right  corner  of  each  cell,  and  I.  =  1  in 
the  lower  left  corner  of  each  cell.  Any  point  within  the  shaded  regions  defined 

by  -1/2  <  <Z!)F  +  (Z2)F  1  1/2  results  in  a  value  of  zero  for  I..  Non-zero 

values  for  [KMe^t-j)]^  merely  shift  the  boundaries  for  the  integers  I. 
within  each  cell  by  altering  the  constant  in  the  boundary  equations  (z  )  +  (z  ) 

'  1  *  ^ '  p1 

constant.  Because  of  the  constraints  on  the  filter  weights,  the  vectors 

(Zi*  V  ZM)  are  concentrated  in  sheets  whose  zM  components  differ  by 

the  scalar  constant  K.  The  central  sheet  cor  re  sponding  to  KM  e  (t-j)  =0 

passes  through  the  origin.  When  each  of  the  components  z.  is  distributed 

over  a  sufficient  number  of  filter -weight  counts,  the  vectors  (z  ,  z  z  ) 

1  2  ’  M' 

tend  to  become  evenly  distributed  over  the  sheets  within  each  of  the  (M-l)- 
dimensional  cells 


m.  -  1/2  <  z.  <  m  +1/2 

1  li 


(i  1,  2 ,  ...,  M- 1 ). 


Accordingly,  the  vectors  (*,,  ^ . V  projected  onto  the  (z , , 

plane  also  tend  to  become  evenly  distributed  within  each  (M-l  )-dimensional 
cell.  Thus  the  conditional  probability  distribution  of  the  vectors 

[(zl)F»  <Z2V*  '•*'  {zM-lV3  within  each  cell  tends  to  become  the  same  for 
all  cells.  As  a  result,  the  roundoff  error  components  5.(j)  (i  =  1,  2,  ....  M-l) 

tend  to  be  evenly  distributed  over  the  interval  C-l/2,  1/2]' .  In  the  adaptive- 
filtering  program  used  for  this  report,  the  changes  z.  before  rounding  each 


III-  1 1  2 


FIGURE  III- 2 3 

REGIONS  IN  THE  (z  ^ ,  z^ )  -PLANE  SATISFYING  THE  INEQUALITIES 
I.  -  1/2  <  8  (j)  +  6  (j )  <  I.  +  1/2  (SHADED  AREAS  CORRESPOND 

J  1  2  J 

TO  THE  INTEGER  I.  =  0) 

J 


filter -weight  component  can  assume  values  separated  by  only  2  filter- 
weight  counts,  so  that  the  probability  distribution  for  each  of  the  first  M-l 
components  §.(j)  is  adequately  approximated  by  the  probability  density 
function 

j  1  |8.<j)|  <  1/2 

P  [8.(j)]  = 

(  0  |8.(j)|  >  1/2 

whenever  the  filter -weight  component  changes  are  spread  over  a  large  number 
of  filter -weight  counts.  The  probability  distribution  for  the  beamsteer  round¬ 
off  error  €  ^  { t  -  j )  was  specified  in  Subsection  B  (Figure  111-4).  The  resultant 

distributions  for  KMe,  (t-j)  and  [KMe  (t-j)]  are  easily  obtained  from 

b  b  r 

it  once  the  scalar  constant  K  is  given.  It  is  now  possible  to  determine  the 
probability  distributions  associated  with  the  filter-weight  roundoff  vector  D. 


Let  D  denote  the  vector 
b 


I 


where  U.  is  the  unit  vector  perpendicular  to  the  constraint  plane  in  the  sub¬ 
space  corresponding  to  the  j-th  lag  (see  Subsection  C,  page  111-25).  Since 
this  vector  is  the  sum  of  individual  vectors  perpendicular  to  the  constraint 
space,  it  is  likewise  perpendicular  to  the  constraint  space.  Let  be  the 

vector  D  =  D  -  D,  .  It  lies  within  the  constraint  space  since,  for  each  lag 
c  b 

j  . 


Ill  -  1  14 


D  therefore  consists  of  a  vector  Dc  within  the  constraint  space  and 
Db  perpendicular  to  the  constraint  space. 

The  squared  magnitude  |d^|  of  the  vector  D  is 


III-  1 1  5 


vector 


A  V  A 

7  E  [v 


KM e  (t- 
b 


E  E  •? 

j=-N  i=l 


(j)  -  -L- 


KM((t. 

b 


r  M-l 

The  random  variable  SM<j)  =  L  -  I  KM  e  (,-j)  +  £  s  (j)l  has  the 

k=l  k  J 

same  probability  distribution  as  the  fractional  part  of  the  random  variable 
M  - 1 

y  =  KM  €  ^(t-j)  +  £  8k(j)-  Since  the  random  variable s  g  (j)  (k  =  1,  2, 

k=l  k 

....  M-l)  and  «b(t-j)  can  be  assumed  mutually  independent,  the  proba¬ 
bility  density  function  for  the  random  variable  y  can  be  expressed  as  the 
convolution  of  the  probability  density  function  for  one  of  the  M-l  roundoff 


error  components,  say  8^(j)  ,  with  the  probability  density  function  for 
the  random  variable  x  =  KMf^jt-j)  +  1  g  . 

k=l 


y+  1  n 

/ 

y-  1/2 


p(x)  dx  . 


Since 


00  00 
^  p(y+n)  =  J"  p(x)dx  =  1  , 


III- 1 16 


the  probability  density  function  for  the  fractional  part  of  the  random  variable 
M-l 

y  KMf  (t-j)+  ^2  g  (j)  is  a  uniform  density  between  -1/2  and  1/2.  As 

k  =  l 

a  result,  each  of  the  M  roundoff  error  components  5.(j)  (i=  1,  2,  ....  M) 
has  a  uniform  density  between  -1/2  and  1/2,  even  though  the  random  variables 
«i<j)  ^  =  2>  •••>  M)  are  not  mutually  independent.  The  probability  dis¬ 

tribution  for  each  roundoff  error  component  g.(j)  is  the  same  for  all  values 
of  K.  Such  is  not  the  case  for  the  discrete  random  variable  L  -  KM  e  (t-i) 

j  b 

A  perspective  drawing  of  its  probability  mass  function  as  a  function  of  K  is 

shown  in  Figure  111-24  for  the  six-channel  case.  Only  the  range  K=0  to 

K=l/2  is  shown  since  tne  probability  mass  function  does  not  depend  on  the 

sign  of  K  and  repeats  whenever  K  changes  by  an  integer.  Probability  is 

concentrated  in  rays  emanating  from  integer  values  L.  along  the  line  K=0. 

Each  ray  corresponds  to  one  of  the  seven  integer  values  which  Mf^(t-j)  can 

assume.  Broken  lines  indicate  rays  where  Mejt-j)  =  +M/2.  The  maximum 

probability  mass  of  0.  092  is  attained  everywhere  along  the  solid  line  directly 

above  L  -  KM  t  (t-j)  =  0.  At  points  in  the  (K,  L.  -  KM  e  (t-j))-plane  where 
J  j  b 

rays  intersect,  the  total  probability  of  the  value  L.  -  KM  e  (t-j)  is  the  sum 

J  ^ 

of  the  individual  probability  mass  functions  associated  with  each  of  the  inter¬ 
secting  rays.  The  spikes  at  points  of  intersection  are  omitted  in  order  to 

avoid  complicating  Figure  111-24  unduly.  The  probability  of  L  -  KM  c  (t-i) 

j  b 

given  a  particular  value  KMt^t-j)  is  the  probability  that 

M-l 

Lj  "  KM €b<t_j)  '  1/2  1  Xj  8i(j)  “  Lj  '  KM€b(H)  +  1/2  . 

i  =  l 


M-i 

This  probability  is  the  integral  of  the  probability  density  function  for  £  §.(j) 

i=  1  1 

over  a  one -count-wide  interval  centered  at  L.  -  KMeb(t-j).  The  integral  is 
equal  to  the  convolution  of  a  uniform  probability  density  between  -1/2  and  1/2 


M-l 


with  the  probability  density  function  for  the  random  variable  £  §  .(j) 

i=l  1 

where  the  convolution  integral  is  evaluated  at  L.  -  KM c  (t-j).  Since  each  of 

J  * 


III- 117 


the  M-l  mutually  independent  random  variables  §.(j)  is  also  assumed  to 
have  a  uniform  density  between  -1/2  and  1/2,  the  probability  of  L.-KMf^t-j) 
given  KM e  ^(t-j)  is  the  convolution  of  M  uniform  density  functions  (each 
equal  to  one  over  the  interval  -1/2  to  1/2),  where  the  convolution  is  evaluated 
at  L  -  KMf  (t-j).  For  this  reason,  the  values  L,.  -  KMe  (t-j)  are  confined 

J  U  J  D 

to  the  interval  (-M/2,  M/2).  To  calculate  the  total  probability  of  L.  -KMe ^(t-j) 

at  any  point  where  it  may  occur,  the  M-fold  convolution  evaluated  at 

L.  -  KM  € ^(t-j)  is  multiplied  by  the  sum  of  the  probabilities  of  the  values 

Mcb(t-j)  corresponding  to  the  rays  passing  through  the  point  in  question.  The 

probability  of  each  value  M«b(t-j)  is  1/M  unless  M  « b<t-j)  =  +  M/ 2,  in 

whicn  case  it  is  1/2M.  The  probability  of  L,  -  KMt^(t-j)  is  zero  whenever, 

at  any  particular  value  of  K  ,  no  ray  passes  through  the  value  L.  -  KMe  (t-i). 

j  b 

The  probability  of  L  -  KMfJt-j)  given  any  value  M^(t-j)  lies  on  an  envelope 
equal  to  the  M-fold  convolution  of  unit-width,  uniform  density  functions.  Thus 
the  broken  lines  in  Figure  III-24  are  located  on  an  envelope  half  as  high  as  the 
envelope  containing  the  solid  lines. 

12 

D  of  the  vector 

cl 

Dc»  the  expression  for  |  Dj^  can  be  rewritten: 


j  =  -N 


dcdc  *  E  E 8i«>  -iE  E »kui 


i=l  k=l 


n  r  m  i  r  m  m 

E  IT  E  6flj)|-  IT  IE  E  8;(j)  «k(j) 

j  =  -N  L  i=l 


i= 1  k=l 
k=/d 


N  M-l  M 


=  I  E  E  E  S-lj)  +  «k(j)  -  2  8,0)  8k(j) 


j  =  -N  i=l  k=i  +  l 


III- 1  19 


M 


N  M-l  M 

E  E  E  -  sku>]2  . 

j  =  -N  i=l  k=i+l  L 

The  mean  of  |Dc  is  1/M  times  the  sum  of  the  means  for  the  individual 
squared  terms  8.(j)  -  8k(j)J2  .  This  fact  does  not  depend  on  the  mutual 
independence  of  the  squared  terms.  Since  S  (j)  and  -  g^j)  have  zero 
means,  the  mean  of  -  Sk<j)j2  is  the  variance  of  the  random  variable 

8i(j)  -  8k(j)  •  Any  subset  of  2  to  M-l  roundoff  error  components  at  the  j-th 
lag  contains  mutually  independent  elements,  so  that 

Var  [«i(j)  '  8k(j>]  =  Var  [«jO)j  +  Var  [sk(j)] 

whenever  M  >  3.  For  a  uniform  probability  density  between  -1/2  and  1/2, 

the  variance  is  1/12,  so  that  the  mean  of  Id  |2  is 

I  c  I 

DTD  =  .  (M-l)M  _  _2_  _  (M- 1 )  (2 N+l ) 

c  c  M  2  12  "  - 12 - 

whenever  M  >  3.  Unless  M=2,  U,e  mean  of  |DJ2  and  the  contribution  to 
the  mean  from  the  subspace  corresponding  to  the  j-th  lag  do  not  depend  on 
K.  A  simple  corollary  results  from  this  fact.  Since 


the  variance  of  the  random  variable  L .  -  KMe^t-j)  is  M/  12  (independent 

of  K)  whenever  M  >  3. 

The  squared  magnitude  |dJ2  of  the  vector  is 


III- 120 


T 

D  D 
b  b 


N  N 


i  E  E 

j  =  -N  k=-N 


M 


E 


L  i=l 


(3) 


r  m 


E  6i(k> 


L  i=i 


T 

U.  U. 

J  k 


n  r  m 


is  E  E  si(j) 


i  =  -N 


L  i=l 


N 


i  E 


i  =  -N 


T2 


L.  -  KMtJt-i) 

J  b' 


Its  mean  is  equal  to  (2N+1)/12  whenever  M  >  3.  The  mean  of  the  squared 
magnitude  I  dI  2  =  IdJ2  +  IdJ2  is  M(2N+1)/12  for  the  same  range  of  M 
values.  The  vector  D  ,  however,  combines  with  the  vector  KE,  in  such 


a  way  that 


Idu  +  KEj‘ 
b  b 


_L 

M 


N 


E  [v 


KMelt-j)  +  KM  c  (t 
b  b 


j  =  -N 


■»]' 


M 


N 


E 

j  =  -N 


L2  , 
J 


where  the  probability  of  any  integer  L.  is  the  probability  that 


M-l 


L  -  1/2  <  KM«b(t-j)  +  T  6.(3)  <  L.  +  1 
J  i  J 


/  2  . 


i=l 


It  depends  solely  on  the  probability  distribution  for  the  random  variable 
M-l 

KMf  (t-j)  +  53  5.(3)  t  whose  probability  density  function  is  pictured  in 

D  ,  _  1 


i=l 


Figure  HI-25  for  the  values  K=0  to  K=6.  By  the  time  K  rises  to  the  value 
2.  5,  the  fingers  corresponding  each  of  the  seven  possible  values  of  ,  (t-j) 
begin  to  separate.  Once  K  is  5.  0  or  above,  they  are  completely  distinct. 

The  maximum  probability  density  of  0.  599  is  attained  at  K=0,  KM  €  (t-j)  + 

^  ^°r  va*ues  °i  K  greater  than  2.5,  the  maximum  probability 

ensity  is  0.  100  on  the  inner  five  fingers,  0.  050  on  the  outer  two.  The  pro¬ 
bability  mass  function  for  L.  also  varies  with  K.  Since  the  probability  of 
L.  is  the  probability  that  the  random  variable  KM  e  (t-j)  +  fi  ( j  j  Hes 

within  1/2  count  of  L.,  the  probability  contained  within  the  one-count-wide 
lanes  straddling  integers  in  Figure  111-25  condenses  to  form  the  probability 
mass  function  for  L.,  which  is  plotted  as  a  function  of  K  in  Figure  lil-26. 

The  maximum  probability  mass  of  0.  550  occurs  at  the  origin.  For  values  of 
K  greater  than  3.  0,  the  maximum  probability  mass  is  0.  092  on  the  inner  five 
fingers,  0.  046  on  the  outer  two.  Note  that  the  finger  corresponding  to 
eb(t-j)  =  0  in  Figure  111-25  necessarily  results  in  a  non-zero  probability  for 
the  integers  y  -1,  L.  -  0.  and  y  1  a,  all  values  of  K  in  Figure  111-26. 
This  fact  is  important  in  part  2  of  this  subsection.  The  random  variable  L. 
is  of  interest  in  its  own  right.  It  represents  the  amount  by  which  the  error' 

'b(t-j)  and  the  eirors  6.(j)  at  the  j-th  lag  shift  the  sum  £  a.(j)  either 

1 

toward  or  away  from  the  j-th  lag  constraint  condition.  At  the  j-th  lag,  it 

completely  specifies  the  probability  distribution  of  the  random  walk  away  from 
the  constraint  condition. 

The  mean  of  |D  +  KE  |2  is 
b  b 


III- 12  Z 


THE  RANDOM  VARIABLE 
FUNCTION  OF  K 


|Db+KEb'  =  M  Lu  [Lj  "  KMCb(t"j,J 

■  tvt  * 


j  =  -N 


JL.  - 

.  J 


£  b^  KM  e  b(t  -  j )  +  KMe  (t-j) 


Since  M-l  of  the  roundoff  error  components  8.(j)  at  the  j-th  lag  are 


dependent  of  KMe^(t-j)  , 


L.  -  KMe^t-j)  KMeb(t-j) 


=  2 


s.{j)  KM  e  b  (t  -  j ) 


=  2  SM(j)  KMtb(t-j) 


-2  KMeb(t-j)  + 


M 

E  N"»] 


KMeb(t-j) 


Whenever  M>2,  the  probability  distribution  for  8  (j)  is  uniform  betwe 

-1/2  and  1/2  and  does  not  depend  on  the  value  of  KMe  (t-j),  so  that 


KMeb(t-j)J  KMeb(t-j)  =  2§M(j)  KMe  (t-j)  = 


-ix  E  fv 


j  =  -N 


j  -  KM  e  (t-j)  +  KMe,  (t-j) 

J  o  b 


111-12  5 


Whenever  M  >  3,  the  variance  of  the  random  variable  L.  -  KMe  (t-j)  is 


M/12  and  does  not  depend  on  K,  so  that 


N 


lDb  +  KEb'2  *  ^r-  +  K' 


E 

j  =  -N 


M  e  b(t-j) 


nr1  +  'Eb'2 


For  the  case  of  6  channels  and  31  filter  points,  the  mean  of  |E^|  is  16.361, 
and  v 


ID  +  KE  |2  =  2.  583  +  16.  361  R2 

b  b 


In  order  to  derive  these  results,  it  was  presumed  that  the  filter -weight  change 
components  z.  were  distributed  over  a  sufficient  number  of  filter -weight 
counts  that  the  vectors  (z  ,  ....  zM)  tended  to  become  evenly  distributed 

over  sheets  within  each  of  the  (M-  1  )-dimensional  cells 


m.  -  1/2  <  z.  £  m,  +  1/2  ( i  —  1 »  2,  ...»  M-l) 

l  l  i 

in  the  subspace  corresponding  to  any  one  of  the  2N+1  lags  of  the  filter  set.  As 
a  result  of  this  premise,  the  roundoff  error  components  §^(j)  assumed  pro¬ 
bability  distributions  independent  of  both  the  prefiltered  vertical-component 
data  values  x.(t-j)  and  errors  e  .(t-j).  The  lack  of  correlation  between  the 

prefiltered  vertical-component  errors  e  .(t-j)  and  filter -weight  roundoff 

1  2 

errors  §  .(j)  implies  that  the  mean  of  |D  +  KE  |  is 

i  c  c 

|Dc+KEc|Z  =  |Dc|2  +K2|Ec|2 

=  (M-IKZNU)  +  ^jTF  . 


111-126 


2  2 
This  mean  is  12.  917  +  28.  07  1  K  for  unsealed  data,  12.  917  +  1081.  952  K 

for  data  scaled  by  16.  The  mean  of  the  squared  magnitude  of  the  error  vec¬ 
tor  D  +  KE  is 

ID  +  D  +  K(E,  +  E  )  I  2  =  ID.  +  KE,  I  2  +  |  D  +  KE  I  2 

'  b  c  '  b  c  1  1  b  b1  1  c  c 

=  |Db|2  +  Id/  +  K2(lEb!2  +  lEc|2  ) 

■  ^  + 

since  both  D  and  E,  are  perpendicular  to  either  D  or  E  .  The  resultant 
b  b  2  c  c 

mean  is  15.5  +  44.  432  K  for  unsealed  data,  15.5  +  1  098.  313  K2'  for  data  scaled 
by  16. 


In  the  same  way  that  an  estimate  was  obtained  in  Subsection  C  for  the 
mean  angle  between  the  vector  (X-X)  and  the  vector  (X-X)  +  E  ,  an  estimate 
can  be  made  for  the  mean  0  of  the  angle  0  between  the  vectors  (X-X)  and 


III-  127 


_j  /  q‘  |Dc!^  e llx-x|2] 


~  tan 


2  , 2 
4Ks  [  y  1  ( t )  ] 


E  [  I  X-X  I  Z] 


In  this  estimate,  the  angle  between  D  +  KE  and  (X-X)  has  been  assumed 

c  c 

to  be  90  .  In  addition,  the  difference  between  '  E  f|X-X  +  E  +  E  |  ]  and 

1—2  be 

E  [  |  X-X  |  ]  has  been  assumed  negligible.  Furthermore,  the  probability  dis- 

2,  _  2  2  2 

tributions  for  |D  |  ,  |X-X|  ,  [  y'(t)  ]  ,  and  |E  |  are  presumed  to  be 

c  c 

sharply  peaked  around  their  respective  mean  values.  In  view  of  all  these 
approximations,  the  estimate  here  is  not  likely  to  be  as  accurate  as  the  cor¬ 
responding  estimate  in  Subsection  C.  For  the  four-hour  noise  sample  from 
day  238  of  1970  which  has  been  used  throughout  this  secrion,  the  mean  of 
|X-X|  was  9693  counts  for  unsealed  data,  2,481,422  counts*1  for  data  -\caled 
by  16.  When  substitutions  are  made  for  values  which  do  not  depend  on  the  con¬ 
vergence  factor  K^,  the  estimated  mean  angle  becomes 


0  5:  tan 


(2  )(12.  917)(9693) 


4K2  [  y 1  (t )  ]  2 

S 


(9693) 


7. 2878  x  10 

KZS  [y'(0l2 


+  2. 8904  x  10 


'or  unsealed  data, 


0  =  tan 


-1  /  7. 2878  x  10 

V  k2  (  y 1  <  t )  ] 2 


+  4.  3602  x  10 


for  data  scaled  by  16.  Here  the  mean  adaptive  filter  output  power  [  y 1  ( t )  ] 


111-128 


for  data  scaled  by  16  is  expressed  in  terms  of  the  equivalent  value  corres¬ 
ponding  to  unsealed  data.  Table  1II-8  gives  the  adaptive  filter  output  power 
for  data  scaled  by  16  (after  division  by  256)  as  a  function  of  the  convergence 
factor  Kg.  The  resulting  estimated  mean  of  the  angle  9  for  unsealed  data 
and  data  scaled  by  16  appears  in  the  two  rightmost  columns  as  a  function  of 
the  convergence  fr.ctor  Kg.  Figure  HI-27  graphs  the  estimated  mean  angles 
6  °f  Tablc  111-8  as  a  function  of  the  convergence  factor  Kg  for  unsealed  data 
and  data  scaled  by  16.  When  the  adaptive  filter  output  y(t)  without  errors 
and  the  adaptive  filter  output  y'(t)  with  errors  are  non-zero  and  have  the 
same  sign,  (1  is  the  angle  between  the  vector  (Anew.A0ld)  without  any 
digitization  or  roundoff  error  and  the  projection  onto  the  constraint  space  of 
e  vector  (A  -A  )  with  digitization  and  roundoff  error.  The  mean  ad¬ 
aptive  filter  output  power  [y'(t)f  was  taken  from  computer  runs  where  the 
data  was  scaled  by  16.  At  higher  convergence  rates,  the  value  [y'(t)]2  for 
unsealed  data  is  Hkely  to  be  larger  than  for  scaled  data.  However,  the  term 
|eJ  /  EdX-Xl  ]  predominates  at  high  values  of  Kg  .  At  lower  convergent 
rates,  the  adaptive  filter  output  power  is  likely  to  be  approximately  the  same 
for  unsealed  data  and  data  scaled  by  16.  Since  no  runs  were  made  with  un¬ 
sealed  data,  the  estimate  of  0  in  the  top  curve  of  Figure  111-27  is  the  best 

possible  without  [  y'(t)]2  measurements  for  unsealed  data.  The  estimates  of 

0  m  FlSure  111-27  arc  the  angle  0  in  the  constraint  space  before  any 
attempt  to  reduce  discrepancies  between  the  filter  weights  a.(j)  and  the  con¬ 
straint  conditions  at  each  of  the  2N+1  lags  in  the  adaptive  filter  set.  A  dis¬ 
cussion  of  the  angle  of  error  after  corrective  procedures  designed  to  force 
reasonable  agreement  with  the  constraint  conditions  must  be  postponed  until 
part  2  of  this  subsection. 

The  effect  on  the  mean  angle  of  error  0  due  to  a  change  in  quantiza¬ 
tion  level  may  be  evaluated  in  terms  of  the  approximation  for  (T.  If  the  quan¬ 
tization  level  is  halved  by  adding  one  additional  bit  to  the  filte r -weight 


III-  1 29 


TABLE  UI  -8 


ESTIMATED  MEAN  "0  OF  THE  ANGLE  BETWEEN  (X-X)  AND 
sgn  |  y ( t)  ]  sgn  [  y ' (t)  ]  (X-X  +  E  +  D  /K)  AS  A  FUNCTION 
OF  THE  CONVERGENCE  fXcTo'R  K  (USING  NOISE 
DATA  FROM  DAY  238  OF  ^970) 


Con  rcrgc  nee 

Adaptive  Filter 

Estimated  Mean  of  the  Angle  0 

Factor  K 

s 

L 

Power  (counts  ) 

Unsealed  Data 

Data  Scaled  by  16 

0.  002 

1  1. 0873 

22°  13' 

22°  6' 

0.  00  3 

10. 99  31 

15°  50' 

15°  36' 

12°  5’ 

0. 009 

10. 0999 

12°  2  3' 

0. 00  3 

9.  7903 

10°  15' 

9 J  52' 

0. 007 

9.  9073 

7°  47' 

7°  16' 

0.010 

9.  0949 

5°  58' 

5°  16' 

0.013 

8.  63  32 

o 

4  40* 

3°  42’ 

0.  02 

8.  3197 

4°  4' 

2°  55' 

0. 03 

7.  789  3 

3°  35' 

2°  12' 

0.03 

6.  937  7 

3°  18' 

1°  41’ 

0. 07 

6.  2887 

3°  12' 

1°  29’ 

0.  10 

3.  6118 

3°  9' 

1°  22’ 

0.13 

3.0118 

3°  7' 

1°  17' 

0.  20 

9. 7491 

3°  6' 

1°  15’ 

0 .  2  3 

9.  6290 

3°  5' 

1°  14’ 

0.  30 

9.  3788 

3°  5' 

1°  13’ 

0.  33 

9.  37  3  3 

3°  5 1 

1°  13' 

0. 90 

4.  3799 

3°  5' 

1°  13’ 

0.  30 

4.  6234 

3°  5' 

1°  12’ 

III-  1  J  0 


Convergence  Factor  K 


FIGURE  IF  -27 

ESTIMATED  MEAN  0  OF  THE  ANGLE  BETWEEN  (X-X)  AND 
•s8n  Ly(t)Jsgn  Cy'(t)](X-X  +  E^  +  D^/K)  AS  A  FUNCTION 
OF  CONVERGENCE  RATE  (USING  NOISE  DATA 
FROM  DAY  238  OF  1970) 


representation,  then  the  same  mean  angle  0  is  achieved  at  about  half  the 
original  convergence  factor  if  the  change  in  the  adaptive -filte r  output 

power  can  be  ignored.  Far  the  data  in  Table  III-8,  the  chaige  in  the  adaptive- 
filter  output  power  is  near  10%  when  K  changes  by  a  factor  of  2,  so  that  the 
gross  effect  of  scaling  the  quantization  level  is  to  multiply  the  abscissa  K 

s 

in  Figure  111-27  by  an  equivalent  amount. 


2.  Error  in  the  Constraint  Conditions 

If  a  signal  s(t)  from  the  look  direction  propagates  across  the  array 
from  which  the  data  used  in  an  adaptive -filtering  process  originates,  the  out¬ 
put  of  the  adaptive  filter  is 


j  =  -N  L  i  =  l 


E  E  •,* 


(j)  s(t-j)  . 


In  effect,  a  filte r  with  weights  £  a.(j)  (j  =  -A . -1,  0,  1 . N)  is 

i=l  1 

applied  to  the  signal.  The  constraint  conditions 


E  ai|j 


(j)  =  a(j) 


determine  the  frequency  response  of  the  filter  set.  In  the  adaptive -filter ing 
process  implemented  for  this  report,  a  white  frequency  response  is  specified 
by  setting  a(j)  =  (wher^  fj  is  the  Kronecker  delta  operator).  The 

effect  of  not  satisfying  the  constraint  conditions  is  to  alter  the  frequency  res¬ 
ponse  of  the  adaptive  processor  to  a  signal  from  the  look  direction. 

After  a  large  number  of  iterations,  a  random  walk  away  from  the  con¬ 
strain*  space  takes  plac'-  in  the  absence  of  an  error-correcting  procedure.  In 


III-  1  3  2 


the  case  of  floating-point  data,  many  iterations  may  elapse  before  the  damage 

"■  —  *  —  - ^ 

V.  or  .  may  be  executed  at  each  iteration  with  a  tech„ique  simiiar  to  O. 
7  5  ,Fr°St’  ,972>-  *"*•««  arithmetic,  the  roundoff  errors  are 

r,r  rirr an  order  of  magni,ude  er°ater  tha- — — — 

et.c,  so  that  frequent  corrective  action  is  necessary. 

M  A'  itera“0n>  'h<!  Pr°baWli*V  ttistrib— tion  of  the  change  in  the  sum 

E  a,(.i)  for  the  j-th  lag  ts  a  discrete  integer  distribution  whose  probabiiity 

mass  function  was  depicted  in  Figure  lll-2b.  The  probability  of  each  possible 
integer  shift  L.  is  the  probability  that 


M-l 


£  4jU)<L  +1/2 

i  =  l 


On  the  firs,  iteration,  the  squared  distance  f 
squared  filter -weight  counts)  beco 


me  s 


rom  the  constraint  space  ( 


in 


N 


|Db+KEbr 


M  S  Lj  ■ 

j  =  -N 


whose  contribution  from  the  i-th  ia„  ,•  r2/x„ 

vector  bach  to  „  j  In  0rder  *°  br«"«  *■  «lt.r 

st  .  ,  C°n  aint  SPaCe  al°n«  1  «*»'  Perpendicular  to  the  con- 

straint  space,  the  quantity  L  /M  must  hf  kf 

the  filter  vector  Thi  i  -  htracted  from  each  component  of 

rr  -  “■  *■ 


III- I  33 


M 

-(M-l)  <  s.  -  /  a.(j)  <  M-l  . 

jo  Z__/  i'J/  ~ 

i~  1 

Let  J  denote  the  discrepancy.  In  order  to  satisfy  the  constraint  conditions 
with  minimum  movement  inside  the  constraint  plane,  sgn(J)  must  be  sub¬ 
tracted  from  J  of  the  M  components  at  one  lag,  while  M-  |j|  components 
must  be  left  alone.  At  any  one  lag,  the  squared  magnitude  of  the  resultant 
displacement  within  the  constraint  plane  is 

Ul  /  M  -  hi  \2  +  (m  -  IjDj2 

\  M  /  M2 

=  1J1  (M  -  |J|  ) 

M 

For  the  integers  such  that  1  <  |l|  <  M/2,  the  squared  magnitude  is  greater 
than  the  previous  squared  magnitude  J2/M  for  the  vector  perpendicular  to 
the  constraint  plane.  It  is  the  same  when  |j|  =  M/2,  and  less  when 
M/2  <  |j|  <  M.  This  procedure  was  employed  by  the  adaptive -filte ring  system 
which  provides  the  results  for  this  repert.  It  converts  error  perpendicular  to 
the  constraint  space  into  error  within  the  constraint  space.  This  feature  is 
an  annoying  disadvantage;  of  the  procedurp s 

If  C  is  the  error  vector  due  to  comcrsion  of  error  perpendicular  to 
the  constraint  space  into  error  within  the  constraint  space,  the  mean  of  \cf 
is 

|C|Z  =  Hr~)  w  <M  -  ui )  • 

If  |J|  were  always  equal  to  M/2,  |c|2  would  reach  the  maximum  value 

Ill-  1  id 


. -^.^>.<-11-.- . -lit 


M(2N+1 )/ 4  . 


1.  ..  dlfflcul«  to  envision  a  situation  where  the  mean  value  would  be  equal  to 
the  maximum.  However,  if  the  integer  shafts  L  were  distributed  in  such 
way  that  the  values  J  =  L.  (modulo  M)  were  equally  likely  at  the  values 
J  -  o,  1 .  M-l,  the  meanof  |c|2  would  be 


M-  1 


(2N+1) 
.  .2 


V  lJf  (M  -  |J|  )  =  12N+1)(M2  -  j) 

J  =  0  6M 


Such  a  distribution  for  the  inteper  r  ,  , 

,  t.  integer  shifts  L  would  occur  if  K  were  sufficient¬ 

ly  high.  The  least  possible  value  for  |c|2  would  occur  a,  K=0.  where  L=+l 
about  half  the  time  and  y  0  about  half  the  time  (when  M  =  6).  At  K  =  0,  J 


(2  N+l )( I )( 5 ) 
2  M 


31*5 

12 


12. 917 


The  mean  angle  8  between  the  vector  (X-X)  and  the  vector 

s*n  Cy(t)]sRn  Qr'(t)J  (X-X  +  E  +  D  /K  +  C/K\  r-  , 

J  c  C/K  +  WK)  can  be  estimated  as  befoi 


0  ~  tan 


c12(IDc|2  +  |C|2)  E [ | X - X | 2] 


4*s  fy'(t)] 


E  [  I  X-Xl  2l 


lDcl  +  |C|  =  25.8  3  3.  _Wjien  the  values  J  =  L.  (modulo  M)  are 

^llyJ^ya^l,„..,M.I,  |Dc,2  +  ,C|2=43.056.  Jlf  ,J(  were  always 

S  +|Cl  WOl“2dl,C  59-417-  ^he  value  4  3.  056  will  be  used.  When 
E  1  I X - X |  ]  -  9693  counts  , 


Ill-  I  35 


for  data  scaled  b  •  16.  Table  HI-9  lists  these  mean  angles  6  for  unsealed 
data  and  data  scaled  by  16  as  a  function  of  the  convergence  factor  K  .  Figure 

"  Braph  °f  tHe  Va‘UeS  “1-9-  H  y'(t)  is  a  non-sero  value 

w,th  the  same  sign  as  y(t),  9  is  the  angle  of  error  in  (A"eW-A°Id)  after  the 

constraint-condition  correction  procedure  which  was  actually  implemented. 

After  the  correction  procedure  is  executed,  the  constraint  conditions  are  .at. 
isfied  exactly  and  all  error  lies  within  the  constraint  space.  At  the  convergence 
factor  Ks  =  0.  005  utilised  most  frequently  in  processing  data  for  this  report 
the  mean  angle  of  error  is  less  than  18°.  This  angle  is  tolerable,  although 
more  than  desired.  The  result  of  the  correction  procedure  actually  employed 
is  almost  to  double  the  angle  of  error  within  the  constraint  space.  In  effect, 
one  bit  in  the  filter -weight  representation  is  lost  when  the  mean  angles  of 
Figure  HI  -  2  8  are  compared  with  those  of  Figure  III-27. 

In  view  ol  the  very  small  error  angles  at  high  convergence  rates,  the 
drop  m  noise  reduction  from  6  dD  to  2  dU  at  ^  =  0.  30  for  one  summer  noise 
sample  when  the  data  scale  factor  was  reduced  from  16  to  I  is  almost  certain- 
y  due  to  zero  y  (t)  values  and  sign  differences  between  y'(t)  and  y(t). 

An  alternative  procedure  which  does  not  add  to  the  error  within  the 
constraint  space  is  to  round  the  value 


III-  1  36 


TABLE  III  -  9 


ESTIMATED  MEAN  6  OF  THE  ANGLE  BETWEEN  (X-X)  AND 
sgn  |  y(t)  ]  sgn  [  y '  ( t)  ]  (X-X  4  E  4  D  /K  4  C/K)  AS  A 
FUNCTION  OF  THE  CONVERGENCE  FACTOR  K 
(USING  NOISE  DATA  FROM  DAY  238  OF  1970)  S 


Convergence 
Factor  K 

s 


Adaptive  Filter- 
Power  (counts  ) 


0.  002 
0.  003 
0.  004 
0.  005 
0.  007 
0.  010 
0.015 
0.  02 
0.  03 
0.  05 
0.  07 
0.  10 
0.  15 
0.  20 
0.  25 
0.  30 
0.  35 
0.  40 
0.  50 


1  1. 0875 
10.  4431 
10.  0444 
9.  7905 
9.  4075 
9.  0444 
8.  6332 
8.  3197 
7.  7843 
6.  9377 
6.  2887 
5.  61 18 
5.  01 18 
4.  7441 
4.  6290 
4.  5788 
4.  5733 
4.  5799 
4.  6254 


27° 

5 

21° 

26 

17° 

43 

13° 

o 

16 

9 

o 

47 

7 

n 

4 

5 

o 

45 

4 

o 

34 

3 

o 

45 

3 

o 

28 

3 

o 

18 

3 

o 

1 1 

3 

o 

9' 

3 

o 

7 

3 

o 

7 

3 

o 

6 

36 

31 

26° 

58 

21° 

16 

17° 

31 

12° 

59 

o 

9 

23 

6° 

29 

o 

5 

0 

3° 

35 

2° 

27 

2° 

0 

1° 

41 

1° 

28 

o 

1 

22 

1° 

18 

o 

1 

16 

1° 

15 

1° 

14 

1° 

13 

Estimated  Mean  of  the  Angle  6 


Unsealed  Data 


„  ,  o 
36  35 


Data  Scaled  by  1  ( 


FIGURE  III- 2 8 

ESTIMATED  MEAN  1)  OF  THE  ANGLE  BETWEEN  (X-X)  AND 

sgn  tly(t)]sgn  [y'(t)](X-X  +  E  +  D  /K  +  C/K)  AS  A 

c  c 

FUNCTION  OF  CONVERGENCE  RATE  (USING 
NOISE  DATA  FROM  DAY  238  OF  1970) 


■  M 

E 

L  i=  1 


a.(j) 


-5.  }  /M 

J° 


to  the  nearest  integer  and  subtract  the  resulting  integer  from  each  component 
at  the  j-th  lag.  When  M  is  an  even  integer  and  the  discrepancy  is  equal  to 
MU  (modulo  M),  the  value  to  be  subtracted  is  rounded  to  the  integer  with  the 
smallest  absolute  value  to  avoid  unproductive  computations.  If  M  were  6 
and  the  discrepancy  were  4,  this  error-correction  procedure  would  subtract 
1  from  each  component  and  produce  a  discrepancy  of  -2.  The  squared  dis¬ 
tance  to  the  constraint  plane  would  fall  from  16/6  to  4/6.  The  other  proce¬ 
dure  wouK  add  P/6  to  the  squared  error  within  the  constraint  plane.  In  fact, 
the  suggested  procedure  always  results  in  less  error  than  the  one  actually 
utilized  unless  the  discrepancy  is  congruent  (modulo  M)  with  zero  or  M/2. 

The  suggested  procedure  limits  the  deviation  from  the  constraint  conditions 
to  the  set  of  integers  with  absolute  value  less  than  (M  +  l)/2.  Such  a  small 
error  should  only  minimally  affect  the  frequency  response  of  the  adaptive 

filter  to  a  signal  from  the  look  direction  when  a  16-bit  representation  of  the 
filter  weights  is  employed. 

If  the  suggested  procedure  were  implemented,  the  probability  of  each 
possible  integer  discrepancy  between  the  filter  weights  and  the  constraint  con¬ 
dition  at  any  lag  could  be  described  in  terms  of  a  Markov  chain.  Figure  III-29 
diagrams  the  set  of  possible  integers  and  transitions  between  integers  for  the 
five-channel  case.  It  is  representative  of  the  situation  for  an  odd  number  of 
channels.  Figure  III-30  is  a  similar  diagram  for  the  six-channel  case.  It  is 
representative  of  the  situation  for  an  even  number  of  channels.  Note  that  the 
integers  ±  M/2  are  combined  into  a  single  state.  Each  time  the  filter  update 

is  performed,  the  sum  £  a  (j)  shifts  by  the  i„.eEer  L.  before  the  error  - 

i=  1  J 

correction  procedure.  After  the  error-correction  procedure,  the  combined 


II)-1  i  9 


FIGURE  III  -  2  9 

SET  OF  POSSIBLE  INTEGER  CONSTRAINT-CON 
AND  TRANSITIONS  BETWEEN  DISCRE1  A 

FIVE -CHANNEL  CASE 


SET 


FIGURE  III- 30 

OF  POSSIBLE  INTEGER  CONSTRAINT -CONDITION  DISCREPANCIES 
AND  TRANSITIONS  BETWEEN  DISCREPANCIES  FOR  THE 

SIX -CHANNEL  CASE 


III-  1 4  1 


effect  of  the  integer  shift  and  correction  is  a  clockwise  trip  of  L  steps 
aiong  the  outer  ring.  In  the  case  of  an  even  number  of  channels,  ^tho  sym- 
metr.c  probability  distribution  of  L.  about  aero  guarantees  that  the  integers 
-M/I  and  M/2  are  equally  likely  after  any  number  of  filter  updates.  The 
symmetry  of  L.  about  aero  also  means  that  the  two  transition  probabilities 

“  ‘he  tranSlti0n  matHX  C°rreSP°ndi"g  “y  Pair  of  states  are  identical:  the 
arkoVitransmon  matrix  is  symmetric.  Since  the  probability  density  function 

Si(j’  fS  non-2er°  over  the  interval  from  -(M-l)/2  to  (M-l)/2,  each 

of  the  integers  y  -I.  y  0,  and  y  ,  is  always  possible  whenever  M  >  3. 

a  result,  all  states  are  possible  at  any  time  after  (M-l)/2  or  M/2  up 
dates.  When,  as  in  this  case,  the  probability  transition  matrix  for  some 
specified  number  of  updates  has  no  rero  elements,  the  transition  probabilities 
Pk/  '"1°  the  f-th  state  for  an  indefinitely  large  number  of  updates  approach 

1XGd  hmit  WhlCh  lS  the  Same  for  ail  Po^ible  states  k  (Gnedenko,  1962) 
onsequently,  the  probability  of  any  state  approaches  a  limit  as  the  number  of 
updates  increases  indefinitely,  so  that  the  probability  of  any  state  is  practically 

Cpend<”’t0f  ■'*““««  ‘he  remote  past.  Since  the  probability  distribu¬ 
tee  for  the  integers  L.  does  not  depend  upon  the  discrepancy  between  the  fil- 

e.  weights  and  the  constrain,  conditions,  any  rotation  of  the  state  labels  along 
the  outer  ring  does  not  change  the  probability  transition  matrix  tor  any  speci- 
ted  number  of  updates.  This  fact  means  that  all  states  are  equally  likely  in 

;  ■'  38  ^  "Umb"  °f  Upda"S  »■  infinity.  For  an  odd  number  of 

channels,  the  limiting  probability  for  each  possible  integer  discrepancy  is  ,/M 

or  an  even  number  of  channels,  i,  is  1/M  for  eac h  integer  discrepancy  with 
a  solute  value  less  than  or  equal  to  M/2-1,  while  it  is  1/2M  for  discrepancies 
°ma  .  M/2'  The  variance  °f  tliis  probability  distribution  about  its  zero 

^aniS  1  “1)/12  f°r  °dd  siegers,  (M  +2)/12  for  even  integers.  When 
is  6,  the  standard  deviation  from  the  constraint  plane  at  the  j-th  lag  is 


III- 142 


1.  i  795  1  filter -weight  counts.  At  the  slight  cost  of  allowing  this  non-cumulative 
error  perpendicular  to  the  constraint  space,  the  mean  angle  of  error  ff  within 
the  constraint  space  can  be  reduced  to  the  level  estimated  in  Figure  Ij.I-27. 

3.  Elimination  of  the  Filter  -  Weight  Roundoff  Error 

The  directional  error  due  to  the  filter -weight  roundoff  error  vector  D 
and  the  beamsteer  roundoff  error  vector  Eb  can  be  eliminated  if  the  adaptive- 
filter  update  equation  is  implemented  in  the  form 


new  old 
A  -  A 


2K  y(t) 

_ s _ 

M(X-X)T(X-X) 


M(X-X)  , 


where  the  factor  2K  y(t)/  [  M  |X-X|  ]  common  to  all  M(2N+1)  components  is 
computed  as  the  product  of  an  integer  times  the  quantization  level  q  of  the  fil¬ 
ter  weights.  When  the  common  factor  rounds  to  zero  for  non-zero  values  of 
y(t),  however,  the  filter  vector  does  not  move.  To  prevent  this  phenomenon 

from  happening  frequently,  the  quantization  level  q  would  need  to  be  chosen 

_  2 

so  that  2K  q/|M*  E[|X-X|  ]|  was  close  to  1.  When  K  =  0.005  and 
—  2^2  s 
E  [  |  X-X  |  ]  =  9693  counts  ,  the  required  quantization  level  is  1/5,815,800  * 

-22 

2  .  Thus  for  the  noise  sample  from  day  238  of  1970  (where  the  RMS  adaptive- 

filter  output  was  only  3  counts),  22  bits  would  be  needed  in  the  filter -weight 
representation.  To  achieve  the  same  goal  for  data  scaled  by  16,  30  bits  would 
be  needed.  It  is  obviously  more  practical  to  perform  the  complete  computation 
at  each  component  before  carrying  out  any  roundoff  procedure. 


F. 


SUMMARY 


When  the  data  points  transmitted  from  Alaska  are  multiplied  by  16 

before  processing,  the  directional  error  in  the  vector  (X-X)  and  the  rela- 

T 

tive  error  of  the  adaptive  filter  output  y(t)  =  X  A  are  reduced  to  the  point 
where  they  are  virtually  the  same  as  the  corresponding  errors  for  floating¬ 
point  arithmetic.  Errors  in  (X-X)  and  y(t)  are  then  limited  only  by  digi¬ 
tization  error.  The  gain  doubling  performed  on  the  ALP  A  seismometers  in 
summer  of  1972  should  permit  these  errors  to  be  halved. 

At  the  convergence  rate  =  0.  005  most  frequently  used  in  process¬ 
ing  data  for  this  report,  the  filter -weight  roundoff  error  appears  to  be  the 
dominant  source  of  error.  With  the  constraint-condition  erro  •  correction 

procedure  actually  employed,  the  mean  angle  of  error  in  the  filter  update 
.  .  new  Aold.  o 

vector  (A  -  A  )  was  approximately  18  at  =  0.  005  (ignoring  sign 
reversals  in  the  adaptive  filter  output).  If  the  alternate  constraint-condition 
error  correction  procedure  (described  in  part  2  of  Subsection  E)  had  been 
implemented,  the  mean  angle  of  error  in  (Ane>/-  A°lc1)  would  have  been  ap¬ 
proximately  10°.  At  convergence  rates  below  K  =  0.005,  the  filter -weight 
roundoff  error  seems  large  enough  to  affect  adaptive -filter  noise  reduction 
relative  to  beamsteering  (see  Figures  IV-1  and  IV-2  in  Section  IV). 


SECTION  IV 


SIGNAL- TO -NOISE  IMPROVEMENT  IN  THE  PRESENCE 
OF  BACKGROUND  NOISE 

A.  INTRODUCTION 

In  this  section,  the  subject  of  investigation  is  the  ability  of  adaptive 
multichannel  filtering  to  provide  signal-to-noise  ratio  improvement  relative 
to  beamsteering  in  the  presence  of  background  noise.  The  critical  area  of 
concern  is  the  improvement  for  weak  signals  in  the  borderline  detection  range 
(where  the  signal-to-noise  ratio  in  the  beamsteer  output  is  between  6  and  12  dB). 
In  this  range  of  signal-to-noise  ratios,  detection  procedures  are  unable  to  rec¬ 
ognize  the  presence  oi  a  signal  with  any  consistency.  Under  these  circumstances 
it  is  not  possible  to  rely  on  the  standard  procedure  of  freezing  an  adaptive  multi¬ 
channel  filter  set  when  a  signal  is  detected.  Results  in  this  section  are  obtained 
solely  from  computer  runs  that  implement  the  adaptive  filtering  process  at  each 
point  in  time  when  a  new  data  sample  is  available.  One  signal  approximately 
6  dB  above  the  noise  level  on  the  beamsteer  output  is  used  for  the  critical  re¬ 
sults.  Two  other  signals  with  signal-to-noise  ratios  of  18  and  24  dB  are  pro¬ 
cessed  in  this  way  to  illustrate  the  desirability  of  preventing  the  adaptive  filter 
update  in  the  presence  of  a  signal.  Provided  that  a  signal  of  this  size  is  not 
overlapped  by  an  interfering  event,  any  dire  effects  on  the  signal  due  to  adap¬ 
tive  filtering  are  of  no  serious  concern:  the  signal-to-noise  ratio  on  the  beam¬ 
steer  output  is  sufficiently  high  that  the  beamsteer  output  is  adequate  for  sub¬ 
sequent  analysis  of  the  event. 

The  method  of  approach  used  to  estimate  signal-to-noise  ratio  improve¬ 
ment  is  to  measure  the  difference  between  noise  reduction  and  signal  degrada¬ 
tion.  Both  noise  reduction  and  signal  degradation  are  computed  in  terms  of  the 

IV- 1 


4  rm&m 


drop  in  power  for  the  adaptive -filte r  output  relative  to  the  beamsteer  output. 
Since  it  is  impossible  in  the  case  of  real  data  to  distinguish  between  signal 
power  and  noise  power  when  a  signal  is  present,  the  data  samples  for  signal 
measurements  must  be  different  from  those  used  for  noise  measurements. 

Noise  reduction  is  computed  over  data  samples  approximately  four  hours 
long.  Signal  degradation,  on  the  other  hand,  is  computed  over  four-  to  eight- 
minute  gates.  In  the  case  of  signal  degradation  measurements,  the  effect  of 
adaptive  filtering  on  contaminating  noise  is  lumped  together  with  the  effect  on 
the  signal  proper.  As  long  as  the  adaptive  filter  set  reduces  the  contaminating 
noise  power  at  least  as  much  as  the  signal  power,  the  signal  degradation  is  not 
underestimated  and  the  signal-to-noise  ratio  improvement  is  not  overestimated. 
Noise  reduction  and  signal  degradation  are  calculated  for  the  same  pair  of  dcAa 
samples  over  a  range  of  convergence  rates.  Prom  these  figures,  the  signal- 
to-noise  ratio  is  determined  as  a  function  of  convergence  rate. 

At  convergence  rates  where  signal  distortion  and  reliability  of  the  noise 
statistics  are  not  serious  problems,  there  are  two  principal  factors  determining 
adaptive -filter  performance.  As  the  convergence  rate  increases,  signal  degra¬ 
dation  climbs  to  a  damaging  level.  Conversely,  as  the  convergence  rate  de¬ 
creases,  the  ability  of  the  adaptive  filter  to  respond  to  changes  in  the  noise 
field  is  impaired.  The  convergence  rate  at  which  signal-to-noise  ratio  im¬ 
provement  is  maximized  lies  somewhere  in  between. 

Subsection  B  discusses  noire  reduction  as  a  function  of  convergence 
rate.  Subsection  C  presents  noise  reduction  as  a  function  of  frequency  for  one 
convergence  rate  close  to  optimum.  Subsection  D  studies  signal  degradation 
as  a  function  of  convergence  rate.  Finally,  the  noise  reduction  and  signal  de¬ 
gradation  results  are  combined  in  Subsection  E  to  yield  signal-to-noise  ratio 
improvement  as  a  function  of  convergence  rate. 


B. 


NOISE  REDUCTION  AS  A  FUNCTION  OF  CONVERGENCE  RATE 


Two  samples,  one  from  day  238  of  1970  and  the  other  from  day  203  of 
1971,  were  used  to  investigate  the  effect  of  convergence  rate  on  adaptive  filter 
noise  reduction  relative  to  beamsteering.  The  look  direction  for  both  samples 
corresponds  to  an  azimuth  of  270°.  Noise  reduction  is  calculated  as  the  ratio 
of  beamsteer  output  power  to  adaptive -filter  output  power.  It  is  computed  for 
the  entire  band  0.  0  to  0.  5  Hz  and  also  for  the  band  0.  0234  to  0.  0664  Hz  (cor¬ 
responding  to  periods  between  43  and  15  seconds).  Each  point  on  the  noise  re¬ 
duction  curves  shown  in  Figures  IV -  1  and  IV-2  was  generated  by  passing  ap¬ 
proximately  four  hours  of  data  through  the  adaptive  filtering  package  at  a  par¬ 
ticular  convergence  rate.  The  noise  reduction  shown  is  for  the  entire  length 
of  the  data  samples  processed.  Convergence  rates  are  defined  as  percentages 
of  the  convergence  rate  K  =1. 

The  first  data  sample  to  be  run  through  the  adaptive  filtering  package 
covers  the  period  0757  to  1150  on  day  238  of  1970.  The  vertical  components 
of  sites  1,  2,  3,  6,  8,  and  9  from  the  ALP  A  array  were  used  as  input  channels 
for  the  adaptive -filter  beam.  The  PDE  bulletin  lists  no  events  between  0639 
and  1502  on  August  26  of  1970.  A  scan  of  the  four-hour  sample  from  12°  to 
348°  at  24°  increments  using  the  Fisher  detector  (see  Subsection  V-A)  indicates, 
however,  that  a  signal  reached  ALPA  at  1021  from  an  azimuth  just  under  60°, 
possibly  from  the  North  Atlantic  ridge.  (Beam  traces  steered  for  60°  show  a 
frequency-cispersed  Love-wave  arrival  prior  to  frequency-dispersed  Rayleigh- 
wave  arrivals  on  the  vertical  and  radial  components.  )  The  energy  from  this 
signal  is  only  a  small  fraction  of  the  total  energy  within  the  four-hour  sample 
and  should  have  only  a  minor  effect  on  the  reported  noise  reduction  values. 

Table  IV - 1  shows  noise  reduction  in  dB  as  a  function  of  convergence  rate  both 
broadband  and  in  the  band  corresponding  to  periods  between  43  and  15  seconds. 
Figure  IV -1  is  a  graph  of  noise  reduction  given  in  Table  IV- 1.  Maximum 
broadband  noise  reduction  was  achieved  near  a  convergence  rate  of  35%.  This 


IV-  3 


TABLE  IV- 1 

ADAPTIVE  FILTERING  NOISE  REDUCTION  VERSUS  CONVERGENCE 

RATE  (DAY  238  1970) 


Convergence  Rate 
(%  of  maximum) 

Broadband  Noi'ie 
Reduction  (dB) 

Narrowband  Noise 
Reduction  (dB) 

0.  2 

1.  099 

1.  156 

0.  3 

1.  359 

1.439 

0.4 

1. 521 

1. 618 

0.  5 

1.  639 

1. 746 

0.7 

1. 813 

1.  936 

1.  0 

1.  983 

2.  124 

1.  5 

2.  186 

2.  347 

2.  0 

2.  346 

2.  526 

3.  0 

2.  635 

2.  847 

5.  0 

3.  135 

3.415 

7.  0 

3.  562 

3.  911 

10.  0 

4.  056 

4.  501 

15.  0 

4.547 

5.  108 

20.  0 

4.786 

5.  413 

25.  0 

4.  892 

5.  563 

30.  0 

4.  940 

5.  639 

35.  0 

4.  945 

5.  666 

40.  0 

4.  939 

5.  667 

50.  0 

4.  896 

5.670 

IV -4 


-  ■  'I', 


...  •  . *'"■ 


Narrowband 
(15-43  Seconds) 


s 


FIGURE  IV-1 


NOISE  REDUCTION  AS  A  FUNCTION  OF  CONVERGENT 
DAY  238  OF  1970  (0757  TO  1150  GMT) 


E  RATE 


fact  is  probably  of  little  importance,  since  the  correlation  between  one  data 

vector  X  {see  page  III- 1  for  a  definition)  and  the  next  is  high.  The  high 

correlation  occurs  because  the  highest  frequency  (0.  0625  -  0.  125  Hz)  at 

which  there  is  any  significant  noise  is  about  one  quarter  to  one  eighth  of  the 

Nyquist  frequency  associated  with  a  one-second  sample  interval.  If  one  data 

vector  X  were  exactly  the  same  as  the  next,  for  example,  the  noise  power 

2 

would  be  reduced  to  (1  -  2K  )  of  its  original  value:  the  update  equation 


.  ,  2K  (X-X)  XTA°ld 
new  .old  s 

A  =  A  +  - — - - 

(X-X)  (X-X) 


would  imply  that 


T  new  T  old 
X  A  =  X  A 


2K  XT(X-X) 

1  +  — -S--T_ - 

(X-X)T(X-X) 


=  XTA°ld 


2K  (X-X)  (X-X) 
(X-X)T(X-X) 


=  XTA°ld  (1  -  2Kg  ) 


(AT)neWXXTAneW  =  (AT)°ldXXTA0ld(l  -2K  )2  . 

s 


This  result  would  also  apply  to  data  vectors  X  containing  signals.  And  so  it 
is  necessary  to  examine  what  happens  to  signals  at  corresponding  convergence 


rates. 


IV -6 


The  second  data  sample  encompasses  the  interval  0355  to  0725  on  day- 
203  of  1971.  Sites  8,  13.  14.  15.  16,  and  17  of  the  ALP  A  array  were  input  to 
the  adaptive -filter  beam.  The  PDE  bulletin  reports  some  activity  in  the  New 
Ireland  and  Northern  Celebes  regions  of  the  South  Pacific  during  this  period 
(notably  a  magnitude  5.4  event  from  0.4N.  123.  5E  at  06:07:52.  8  GMT).  Since 
the  azimuth  from  which  this  energy  arrives  is  extremely  close  to  270  (the 
look  direction),  the  data  sample  contains  signal  energy.  The  relative  strength 
of  energy  from  seismic  events  in  this  data  sample  is  discussed  later  in  Sub¬ 
section  B  of  Section  VI.  Apparently  a  sizeable  fraction  of  the  total  energy  is 
due  to  seismic  events  (both  on-azimuth  and  off  -  azimuth).  Table  IV-2  and 
Figure  IV-2  give  the  noise  reduction  in  dB  for  this  sample  as  a  function  of 
convergence  rate  in  the  frequency  bands  0.  0-0.  5  Hz  and  0.0234-0.0664  Hz. 
Maximum  broadband  noise  reduction  occurs  around  a  25-30%  convergence 
rate.  Note  that  the  broadband  noise  reduction  drops  abruptly  between  a  32% 
and  35%  convergence  rate,  whereas  the  narrowband  reduction  stays  more  or 
less  level  all  the  way  to  a  50%  convergence  rate.  The  reason  for  this  drop 
is  that  the  adaptive  filter  set  temporarily  diverged  over  a  short  section  of 
data  containing  a  glitch.  The  result  of  the  divergence  was  a  rcpid  oscillation 
in  the  adaptive  filter  output  at  the  folding  frequency.  The  folding  frequency 
does  not  show  up  in  the  narrowband  analysis,  and  hence  the  narrowband  noise 
reduction  remains  relatively  constant  beyond  a  32%  convergence  rate. 

Note  that  the  noise  reduction  in  both  these  samples  begins  to  arc  down¬ 
ward  slightly  as  the  convergence  rate  decreases  from  =  0.  5%  to  K^=  0.2%. 
The  effects  of  rounding  the  filter  weights  to  the  nearest  computer  count  are 
probably  becoming  evident.  A  less  plausible  explanation  is  that  tne  adaptive 
filter  set  utilizes  a  sizeable  fraction  of  the  total  four-hour  sample  before  it 
completes  the  transition  from  the  initial  beamsteer  weights  to  a  filter  set  re¬ 
flecting  the  statistics  of  the  noise  field. 


TABLE  IV -2 


ADAPTIVE  FILTERING  NOISE  REDUCTION  VERSUS  CONVERGENCE 

RATE  (DAY  203  1971) 


Convergence  Rate 
(%  of  maximum) 

Broadband  Noise 
Reduction  (dB) 

Narrowband  Noise 
Reduction  (dB) 

0.  2 

1.705 

1.  828 

0.  3 

2.  044 

2.  181 

0.4 

2.275 

2.421 

0.  5 

2.442 

2.  596 

0.  7 

2.  686 

2.  852 

1.  o 

2.  937 

3.  123 

1.  5 

3.207 

3.429 

2.  0 

3.420 

3.  679 

3.  0 

3.  782 

4.  122 

5.  0 

4.  362 

4.  826 

7.  0 

4.793 

5.  335 

10.  0 

5.  246 

5.  864 

15.  0 

5.  611 

6.293 

20.  0 

5.764 

6.481 

25.  0 

5.  814 

6.  547 

30.  0 

5.  807 

6.  561 

32.  0 

5.  806 

6.  562 

33.  0 

5.  732 

6.  557 

35.  0 

5.494 

6.  558 

40.  0 

5.  393 

6.  537 

50.  0 

5.  281 

6.498 

IV-8 


Both  of  the  noise  samples  studies  so  far  were  taken  from  summertime 
(when  the  noise  is  at  its  lowest).  In  Subsection  C,  noise  reduction  as  a  function 
of  frequency  for  a  sample  from  day  7  of  1972  is  discussed  briefly.  Also  in 
Section  VI,  where  noise  reduction  is  examined  as  a  function  of  steer  direction, 
a  data  sample  near  the  peak  autumn  noise  level  is  processed.  These  two 
samples,  one  from  winter  and  one  from  fall,  give  some  idea  of  the  reduction 
of  background  noise  achievable  later  in  the  year. 

C.  NOISE  REDUCTION  AS  A  FUNCTION  OF  FREQUENCY 

To  determine  the  frequencies  at  which  adaptive  filtering  was  producing 
noise  reduction,  power  density  spectra  for  the  two  beams  were  computed. 
Spectra  in  this  report  are  given  in  dB  relative  to  one  millimicron  squared 
per  Hz  at  frequencies  where  the  prefilter  response  is  unity.  If  square-root- 
of-N  noise  reduction  had  been  achieved  through  beamsteering,  equivalent 
single-sensor  noise  levels  would  be  7.  8  dB  (i.  e.  ,  101og^6)  higher  than  the 
beamsteer  power  density  spectra. 

Figure  IV-3  presents  output  spectra  of  the  conventional  and  adaptive- 
filter  beams  for  the  noise  sample  from  day  238  of  1970.  The  spectrum  rises 
to  a  peak  of  26.  5  dB  at  a  period  of  18  seconds  and  descends  rapidly  from  this 
peak  as  the  frequency  increases.  Figure  IV-4  is  a  graph  of  the  noise  reduction 
from  adaptive  filtering  at  a  convergence  rate  0.  5%  of  maximum.  Noise  reduc¬ 
tion  is  measured  as  the  ratio  of  the  beamsteer  output  power  density  to  the 
adaptive -filter  output  power  density.  Peak  noise  reduction  of  3.  0  dB  is  ob¬ 
tained  at  a  frequency  corresponding  to  a  period  of  20  seconds.  Noise  reduc¬ 
tion  of  1  dB  is  maintained  at  almost  all  periods  above  13  seconds.  Positive 
noise  reduction  is  preserved  at  all  frequencies  where  there  is  significant 
power . 

Corresponding  beam  output  spectra  for  the  noise  sample  from  day  203 
of  1971  are  plotted  in  Figure  IV-5.  The  adaptive -filter  convergence  rate  was 


IV- 10 


Power  Density  (dB) 


Period  (Seconds) 


64  3 2  16  8 


FIGURE  IV  -3 

BEAMSTEER  AND  ADAPTIVE  FILTER  OUTPUT  POWER  DENSITY 
DAY  238  1970,  STEER  DIRECTION  270°,  Kg  :  0.005) 


Power  Density  (dB) 


64  32 


8 


Period  (Seconds) 
16 


FIGURE  IV -4 

ADAPTIVE  FILTER  NOISE  REDUCTION  AS  A  FUNCTION 
OF  FREQUENCY  (DAY  238  1970,  STEER 
DIRECTION  270°,  Ks  :  0.  005) 


IV  -  12 


Power  Density  (dE*) 


64  32 


Period  (Seconds) 


0.02  0.06  0.10  0.14 
Frequency  (Hz) 


FIGURE  IV -5 

BEAMSTEER  AND  ADAPTIVE  FILTER  OUTPUT  POWER  DENSITY 
(DAY  203  1971,  STEER  DIRECTION  270°,  K  =  0.005) 


aga.n  0.  5*  of  maximum.  The  beamsteer  spectrum  rises  ,o  a  peak  with 
periods  in  the  range  2.-26  seconds.  A,  higher  frequencies,  spectral  level 
drops  significantly.  Figure  1V-6  gives  the  adaptive -filtering  noise  reduction 
relative  to  beamsteering.  In  this  sample,  peak  noise  reduction  of  4.  3  dB 
occurs  a,  a  period  of  21  seconds.  Once  again,  positive  noise  reduction  is 
maintained  at  all  frequencies  with  significant  power.  Due  to  the  presence 
of  a  glitch  and  on-azimuth  events  in  the  data,  some  caution  should  be  exer¬ 
cised  the  interpretation  of  the  results  (particularly  in  connection  with  the 
three  major  peaks  in  noise  reduction).  Possibly  the  reduction  in  the  noise 
proper  is  comparable  to  that  for  day  238  of  1970. 

Another  noise  sample  from  day  7  of  1972  was  processed.  1,  was  a- 
typ.cal  because  the  spectrum  peaked  at  7  1/2  seconds!  the  principal  micro¬ 
seism, c  peak  a,  17  seconds  was  28.  5  dB  relative  to  one  millimicron  squared 

per  Hz,  whereas  the  secondary  microseismic  peak  was  32  dB  a,  7  1/2  seconds. 

Highest  noise  reduction  was  3  8  HR  at  17  1  /■>  , 

was  3.  5  dB  at  17  1/2  seconds.  In  contrast,  the  noise 

re  action  at  7  1/2  seconds  was  only  2.2  dB.  At  a  convergence  rate  0.  5%  of 

maximum,  broadband  noise  reduction  was  1 . 8  dB,  narrowband  noise  reduc- 
tion  2.  1  dB. 

Evidently  noise  reduction  is  greatest  a,  the  principal  spectral  peaks  of 
the  ALP  A  noise.  After  the  prefilter  of  Figure  II- 1  is  applied  to  the  vertical. 

component  data,  these  peaks  are  (1)  the  principal  microseismic  peak  near 
18  seconds,  and  (2)  ‘he  secondary  microseismic  peak  near  8  seconds. 


D. 


SJGNAt  DEGRADATION  AS  A  FUNCTION  OF  CONVERGENCE  RATE 


To  probe  the  effects  of  convergence  rate  on  adaptive -filter  signal  de¬ 
gradation,  three  signals  were  selected  out  of  a  four-hour  sample  from  day 
1971  (October  3).  The  sample  spans  the  time  period  2000  to  2357. 

Site,  8,  9,  12,  15,  16,  and  17  art  input  to  the  time -shift -and . sum  and  adaptive 

filter  beams.  In  this  analysis,  the  adaptive-filter  is  permitted  to  update  under 
all  conditions  and  is  never  frozen. 


IV- 14 


F requency  (Hz ) 


The  critical  signal  arrived  at  ALPA  at  approximately  2140.  The 
signal-to-noise  ratio  on  the  beamsteer  output  is  about  6  dB.  No  event  cor¬ 
responding  to  this  signal  can  be  found  on  either  the  PDE  bulletin  or  LASA 
bulletin.  Various  forms  of  frequency-wavenumber  spectra  computed  at  sev¬ 
eral  frequencies  peak  at  azimuths  ranging  from  299°  to  306°.  A  Fisher- 
detector  scan  of  this  signal,  moreover,  shows  a  higher  detector  output  at 
300  than  at  the  two  neighboring  look  azimuths  of  276°  and  324°.  The  beam- 
steer  and  adaptive -filter  beams  are  aimed  toward  302.5°.  Signal  degrada¬ 
tion  for  this  event  is  computed  at  convergence  rates  from  0.2%  to  25%  of 
maximum  over  a  5  12 -second  gate.  Table  IV-3  and  Figure  IV-7  depict  signal 
degradation  as  a  function  of  convergence  rate  both  broadband  and  in  the  fre¬ 
quency  band  associated  with  periods  between  43  and  15  seconds.  As  the  con¬ 
vergence  rate  increases  beyond  0.  5%,  the  narrowband  degradation  rises  a- 
bove  the  broadband  degradation.  The  negative  degradation  at  the  0.2%  rate 
means  that  the  adaptive  filter  trace  contains  more  energy  than  the  beamsteer 
output  trace  over  the  512 -point  gate.  (As  a  check  on  the  measurement  tech¬ 
nique,  the  convergence  rate  was  set  equal  to  zero,  and  the  result  was  a  power 
difference  no  larger  than  0.  005  dB.  )  Figure  IV-8  shows  the  beamsteer  and 
adaptive -filter  output  for  this  event  at  a  convergence  rate  of  0.  5%.  Tick 
marks  at  the  zero  levels  of  both  beams  are  spaced  5  minutes  apart  and  cor¬ 
respond  to  the  times  given  midway  between  the  traces.  The  beamsteer  out¬ 
put  is  shown  in  the  top  trace,  the  adaptive -filter  output  in  the  bottom  trace. 

The  first  of  the  two  large  signals  examined  in  this  section  is  an  event 
listed  on  the  PDE  bulletin.  This  earthquake  from  near  the  east  coast  of  Kam¬ 
chatka  has  a  bodywave  magnitude  of  4.  5.  The  origin  time  is  20:54:48.  5 
(again  on  day  276  of  1971),  the  latitude  55.7  N,  the  longitude  162.  1  E.  The 
azimuth  with  respect  to  ALPA  is  273  .  Accordingly,  the  beams  are  trained 
in  this  direction.  18  dB  is  rhe  signal-to-noise  ratio  on  the  beamsteer  output. 
Signal  degradation  is  computed  over  a  256-second  gate  at  convergence  rates 


IV  -  1  6 


Ai  APTIVE  FILTERING  SIGNAL  DEGRADATION  VERSUS  CONVERGENCE 
RATE  FOR  A  WEAK  SIGNAL  FROM  300°-305°  (DAY  276  1971) 


Convergence  Rate 
(%  oi  maximum) 


10.  0 
15.  0 
20.  0 
25.  0 


Broadband  Signal 
Degradation  (dB) 

-0.  024 
0.  104 
0.255 
0.408 
0.  650 
0.  945 

1.  301 
1  ,  569 
1. 970 

2.  562 

3.  049 
3.713 

4.  505 

4.  973 

5.  176 


Narrowband  Signal 
Degradation  (dB) 

-0.  039 
0.  093 
0.  251 
0.408 
0.  655 
0.  956 

1.  320 

1.  599 

2.  031 

2.  686 

3.  229 

3.  948 

4.  781 

5.  283 

5.  526 


IV  - 1 7 


Convergence  Rate  K 
FIGURE  IV -7 


SIGNAL  DEGRADATION  AS  A  FUNCTION  OF  CONVERGENCE  RATE 
FOR  A  WEAK  SIGNAL  APPROXIMATELY  6  dB  ABOVE  NOISE 
LEVEL  ON  BEAMSTEER  OUTPUT  (STEER  AZIMUTH  302.  5°) 


Pi 


Beamstecr  Output 


lvy\*A/vV^^  \J\J  \J\ \AA  /  \J\/\f 


5tGHt"iJT  24 
276.21.  4u«  0  3 


OiGMh  NT  20 
276*  2 ' <  m0.»  u  3 


Adaptive  Filter  Output 


100 
m  ft 


FIGURE  IV-8 

WEAK  EVENT  FROM  300°-305° 

(FILTER  ADAPTING.  STEER  DIRECTION  302.  5°,  Kg  =  0.  005) 


IV- 19 


I 


varying  from  0.2%  to  25%.  1  he  resulting  signal  degradation  figures  are  por- 

trayei  in  Tahle  IV-4  and  Figure  IV-9.  For  convergence  rates  up  to  10%,  the 
signal  degradation  for  this  stronger  event  is  greater  than  before.  At  a  5% 
convergence  rate,  for  example,  the  signal  degradation  for  this  event  is  about 
0.  9  dB  greater  than  for  the  event  from  300°  to  305°. 

The  last  signal  used  to  measure  signal  degradation  is  a  magnitude 
4.  9  event  on  the  same  day  from  almost  exactly  the  same  location.  The  PDE 
bulletin  gives  the  origin  time  as  21:54:12.  9,  the  latitude  as  55.  8N,  and  the 
longitude  as  162.  2L.  As  before,  the  beams  are  directed  toward  a  273° 
azimuth.  For  this  event,  the  signal-to-noise  ratio  on  the  beamsteer  output 
is  close  to  24  dB.  Signal  degradation  over  a  256-point  interval  is  again  cal¬ 
culated  for  convergence  rates  from  0.  2%  to  25%.  Resultant  signal  degradation 
is  represented  in  Table  IV-5  and  Figure  IV-10.  It  is  noticeably  higher  than 
for  the  event  with  an  18-dB  signal-to-noise  ratio.  Figure  IV- 11  pictures  the 
beamsteer  output  and  adaptive -filte r  output  for  this  tremor  when  the  conver¬ 
gence  rate  is  0.  5%,  At  that  rate,  degradation  is  just  under  2  dB.  The  worst 
distortion  occurs  at  the  end  of  the  event. 

Two  features  in  the  signal  degradation  cu'ves  for  the  larger  events  are 
worth  mentioning.  As  the  signal-to-noise  ratio  of  these  events  increases,  so 
does  the  signal  degradation.  With  the  particular  adaptive  algorithm  employed, 
it  is  abundantly  clear  that  the  maximum-likelihood  constraints  do  not  guarantee 
that  the  signal  is  preserved.  Furthermore,  the  signal  degradation  begins  to 
approach  a  maximum  at  lower  convergence  rates  than  does  the  noise  reduction. 

E.  SIGNAL-TO-NOISE  GAIN  AS  A  FUNCTION  OF  CONVERGENCE  RATE 

In  this  subsection,  signal  degradation  is  subtracted  from  noise  reduc¬ 
tion  to  yield  signal-to-noise  ratio  improvement.  This  procedure  is  carried 
out  over  a  range  of  convergence  rates  in  order  to  discover  the  convergence 
rates  where  the  largest  signal-to-noise  gain  is  achieved. 


IV- 20 


TABLE  IV -4 


ADAPTIVE  FILTERING  SIGNAL  DEGRADATION  VERSUS  CONVERGENCE 
RATE  FOR  A  STRONG  SIGNAL  FROM  KAMCHATKA  (DAY  276  1971) 


Convergence  Rate 
(%  of  maximum) 

Broadband  Signal 
Degradation  (dB) 

Narrowband  Signal 
Degradation  (dB) 

0.  2 

0.  269 

0.  268 

0.  3 

0.416 

0.410 

0.4 

0.  556 

0.  544 

0.  5 

0.  693 

0.  674 

0.7 

0.  944 

0.  915 

1.  0 

1.279 

1.  236 

1.5 

1.  743 

1. 684 

2.  0 

2.  137 

2.  070 

3.  0 

2.718 

2.  650 

5.  0 

3.  423 

3.  369 

7.  0 

3.  775 

3.  735 

10.  0 

3.  989 

3.  958 

15.  0 

4.  119 

4.  085 

20.  0 

4.  203 

4.  170 

Broadband 


Convergence  Rate 


FIGURE  IV- -) 

SIGNAL  DEGRADATION  AS  A  FUNCTION  OF  CONVERGENCE 
RATE  FOR  A  SIGNAL  18  dB  ABOVE  NOISE  LEVEL 
ON  BEAMSTEER  OUTPUT 


TABLE  IV -5 


ADAPTIVE  FILTERING  SIGNAL  DEGRADATION  VERSUS  CONVERGENCE  RATE 
RATE  FOR  A  VERY  STRONG  SIGNAL  FROM  KAMCHATKA  (DAY  27  6  1971) 


Signal  Degradation  (dB) 


SIGNAL  DEGRADATION  AS  A  FUNCTION  OF  CONVERGENCE 
RATE  FOR  A  SIGNAL  24  dB  ABOVE  NOISE  LEVEL 
ON  BEAMSTEER  OUTPUT 


FIGURE  IV-11 

MAGNITUDE  4.  9  EVENT  FROM  KAMCHATKA 
(FILTER  ADAPTING,  STEER  DIRECTION  273°,  K8  =  0.  005) 


IV-25 


Since  the  weak  signal  from  300°  -  305°  has  a  signal-to-noise  ratio  of 
6  dB  on  the  beamsteer  output  and  lies  within  the  borderline  detection  range, 
it  is  the  source  of  the  most  meaningful  results.  Noise  reduction  measure¬ 
ments  from  day  238  of  1970  and  day  203  of  1971  are  used  to  estimate  the 
signal-to-noise  gain  which  would  have  occurred  on  those  days  if  the  weak 
signal  had  arrived  at  ALPA  during  either  of  the  two  noise  samples.  Tables 
1V-6  and  1V-7  give  the  signal-to-noise  gain  for  this  event  using  the  noise  re¬ 
duction  measurements  from  day  238  of  1970  and  203  of  1971,  respectively. 
Figure  IV- 12  contains  the  results  for  both  days.  Narrowband  gain  is  at  the 
top  in  each  pair  of  curves.  Best  signal-to-noise  ratio  improvement  is  ob¬ 
tained  near  a  convergence  rate  0.  5%  of  maximum.  At  that  rate,  signal-to- 
noise  gain  is  about  1.  3  dB  for  noise  from  day  238  of  1970,  about  2.  1  dB  for 
"noise"  from  day  203  of  1971.  Since  the  signal  degradation  approaches  the 
same  order  of  magnitude  as  the  noise  reduction  at  higher  convergence  rates, 
it  becomes  increasingly  difficult  to  determine  what  proportion  of  the  remain¬ 
ing  adaptive  filter  output  is  from  the  weak  signal.  The  high  ratio  of  noise  to 
signal  in  this  event  further  complicates  the  situation  at  higher  convergence 
rates.  As  a  result,  the  curves  in  Figure  1V-12  become  more  and  more 
questionable  as  they  sweep  from. left  to  right.  An  examination  of  the  adaptive - 
filter  output  trace  is  useful  in  estimating  the  attenuation  of  the  signal  proper 
at  higher  convergence  rates.  Figure  1V-13  exhibits  the  beamsteer  and  adap¬ 
tive  filter  output  for  the  critical  weak  event  when  the  convergence  rate  is  25% 
and  the  filter  set  is  being  updated.  It  appears  that  the  signal  alone  has  been 
knocked  down  by  an  amount  consistent  with  the  stated  values  in  Table  1V-3. 
Consequently,  more  confidence  can  be  placed  in  the  results  derived  solely  from 
power  measurements:  for  this  signal,  at  least,  maximum  signal-to-noise 
gain  does  indeed  occur  near  a  convergence  rate  of  0.5%.  Figure  IV-13  also 
illustrates  an  artifact  created  by  the  high  convergence  rate.  Toward  the  end 
of  the  adaptive-filter  output  in  the  bottom  trace,  there  is  a  noticeable  distur¬ 
bance  which  is  missing  in  the  beamsteer  output  and  the  adaptive -filter  output 


I 


TABLE  TV-6 

ADAPTIVE  FILTERING  SIGNAL-TO-NOISE  GAIN  VERSUS  CONVERGENCE 
RATE  FOR  A  WEAK  SIGNAL  FROM  300°-305°  (USING  NOISE  REDUCTION 
MEASUREMENTS  FROM  DAY  238  OF  1970) 


Convergence  Rate 
(%  of  maximum) 

Broadband 

Signal -to -Noise 

Gain  (dB) 

Narrowband 
Signal- to-  Noise 
Gain  (dB) 

0.  2 

1.  123 

1.  195 

0.  3 

1.255 

1.  346 

0.4 

1. 266 

1.  367 

0.  5 

1.231 

1.  338 

0.  7 

1.  163 

1. 281 

1.  0 

1.  038 

1.  168 

1.  5 

0.  885 

1. 207 

2.  0 

0.  777 

0.  927 

3.  0 

0.  665 

0.  816 

5.  0 

0.  573 

0.  729 

7.  0 

0.  513 

0.  682 

10.  0 

0.  343 

0.  553 

15.  0 

0.  042 

0.  327 

20.  0 

-0. 187 

0.  130 

25.  0 

-0.  284 

0.  037 

TABLE  IV -7 

ADAPTIVE  FILTERING  SIGN  AL  -  TO -NOISE  GAIN  VERSUS  CONVERGENCE 
RATE  FOR  A  WEAK  SIGNAL  FROM  300° -305°  (USING  NOISE  REDUCTION 
MEASUREMENTS  FROM  DAY  203  OF  1971) 


Convergence  Rate 
(%  of  maximum) 


Broadband 
Signal  -to  -Noise 
Gain  (dB) 


1.729 
1.  930 


2.  020 


2.  034 
2.  036 
I.  992 
1.  906 


1.  851 


1.  812 


1.  800 
1. 744 
1.  533 
1.  106 
0.  791 
0.  638 


Narrowband 
Signal-to  -Noise 
Gain  (dB) 


1.  867 

2.  088 
2.  170 
2.  188 
2.  197 
2.  167 
2.  109 
2.  080 
2.091 
2.  140 
2.  106 
1.  916 
1.  512 


1.  198 
0.  984 


Gain  (dB) 


FIGURE  IV- 12 


SIGNAL- TO -NOISE  GAIN  AS  A  FUNCTION  OF  CONVERGENCE  RATE 
FOR  A  WEAK  SIGNAL  APPROXIMATELY  6  dB  ABOVE  NOISE 
LEVEL  ON  BEAMSTEER  OUTPUT  (STEER  AZIMUTH  302  5°i 


Beamsteer  Output 


Vy/wWV 


SEGMENT  24 
276,21,40, 03 


SEGMENT  25 
276,21,45,03 


Adaptive  Filter  Output 


FIGURE  IV- 13 

WEAK  EVENT  FROM  300°-305° 

(FILTER  ADAPTING,  STEER  DIRECTION  302.5°,  Kg  =  ( 


IV-  30 


at  a  convergence  rate  of  0.  5%  (see  Figure  IV-8).  Apparently  the  adaptive- 
filtering  process  is  attempting  to  use  filtered  noise  in  order  to  cancel  the 
signal  which  has  just  died  away. 

In  the  remainder  of  this  subsection,  the  two  strong  Kamchatka  signals 
are  treated  in  tandem  to  demonstrate  the  greater  signal  degradation  and  cor¬ 
responding  lower  signal-to-noise  ratio  improvement  frr  signals  with  increas¬ 
ingly  higher  signal-to-noise  ratios.  It  is  again  worth  emphasizing  that  first 
the  beamsteer  output  is  available  for  subsequent  analysis  when  adaptive 
filtering  spoils  a  strong  signal  and  second,  if  necessary,  the  situation  can  be 
alleviated  simply  by  freezing  the  adaptive  filter  set  upon  detection  of  a  signal. 

Tables  IV-8  and  IV-9  present  the  signal-to-noise  ratio  improvement 
using  noise  measurements  from  day  238  of  1970  for  the  earlier  18  dB  signal 
and  the  later  24  dB  signal,  respectively.  Figure  IV-14  combines  the  results 
given  in  both  tables.  Corresponding  results  for  day  203  of  1971  are  contained 
in  Tables  IV-10  and  IV-11  and  Figure  IV-15.  On  both  days  signal  degradation 
is  much  worse  for  the  later  and  stronger  event.  Positive  improvement  for  the 
larger  signal  is  achieved  only  at  extremely  low  convergence  rates.  Maximum 
improvement  occurs  near  a  convergence  rate  of  0.  2%.  There  broadband  and 
narrowband  signal-to-noise  gain  for  the  larger  event  are  0.  136  dB  and  0.  208 
dB,  respectively,  for  day  238  of  1970,  and  0.742  dB  and  0.880  dB,  respec¬ 
tively,  for  day  203  of  1971.  The  signal-to-noise  ratio  improvement  values 
for  the  smaller  Kamchatka  signal  are  more  in  line  with  those  of  the  6-dB 
signal  from  300  -  305°:  the  best  results  are  near  a  convergence  rate  0.5% 

of  maximum.  For  both  the  strong  Kamchatka  signals,  the  worst  improvement 
over  beamsteering  occurs  at  a  5%  convergence  rate  (unlike  the  weaker  event 
from  300°  -  305°), 

An  interesting  phenomenon  happens  as  the  convergence  rate  increases 
above  5%:  the  gain  for  the  two  Kamchatka  earthquakes  begins  to  rise  again. 
Narrowband  signal-to-noise  ratio  improvement  actually  exceeds  that  achieved 


IV -31 


.  ..  .  t.  _ _ .  a4“ni  '  u 


TABLE  IV -8 

ADAPTIVE  FILTERING  SIGNAL  -  TO -NOISE  GAIN  VERSUS  CONVERGENCE 
RATE  FOR  A  STRONG  SIGNAL  FROM  KAMCHATKA  (USING  NOISE 
REDUCTION  MEASUREMENTS  FROM  DAV  238  OF  1970) 


Convergence  Rate 
(%  of  maximum) 


Broadband 
Signal  -  to  -Noi  se 
Gain  (dB) 


Narrowband 
Signal  -to-Noise 
Gain  (dB) 


10.0 

15.0 


0.  830 
0.  943 
0.  965 
0.  946 
0.  869 
0.  704 
0.  443 
0.  209 
-0. 083 
■0. 288 
■0. 213 
0.  067 
0. 428 


0.  888 
1. 029 
1. 064 
1. 072 
1. 021 
0.  888 
0.  663 
0.  456 
0.  197 
0.  046 
0.  176 
0.  543 


1. 023 


TABLE  IV -9 


ADAPTIVE  FILTERING  SIGNAL  -  TO -NOISE  GAIN  VERSUS  CONVERGENCE 
RATE  FOR  A  VERY  STRONG  SIGNAL  FROM  KAMCHATKA  (USING 
NOISE  REDUCTION  MEASUREMENTS  FROM  DAY  238  OF  1970) 


Convergence  Rate 
(%  of  maximum) 

Broadband 

Signal-to  -Noise 

Gain  (dB) 

Narrowband 

Signal-to  -Noi  se 

Gain  (dB) 

0.  2 

0.  136 

0.  208 

0.  3 

0.  035 

0.  134 

0.  4 

-0. 134 

-0.015  | 

0.  5 

-0. 338 

-0. 208 

0.  7 

-0. 758 

-0. 614 

1.0 

-1. 377 

-1. 229 

1.5 

-2. 264 

-2.135 

2.  0 

-2. 948 

-2.  845 

3.  0 

-3.  732 

-3. 677 

5.  0 

-4.  044 

-4.  002 

7.  0 

-3.  856 

-3. 782 

10. 0 

-3.  405 

-3.  263 

15.0 

-2.  918 

-2. 692 

20.0 

-2.  709 

-2.  437 

25.  0 

-2. 605 

-2. 299 

IV-  33 


Signal -to -Noise  Gain  (dB) 


FIGURE  IV- 14 

SIGNAL- TO -NOISE  GAIN  AS  A  FUNCTION  OF  CONVERGENCE  RATE 
FOR  TWO  STRONG  KAMCHATKA  SIGNALS  (USING  NOISE 
REDUCTION  MEASUREMENTS  FROM  DAY  238  OF  1970) 


TABLE  IV- 10 

ADAPTIVE  FILTERING  SIGNAL-TO-NOISE  GAIN  VERSL 
RATE  FOR  A  STRONG  SIGNAL  FROM  KAMCHATKA 
REDUCTION  MEASUREMENTS  FROM  DAY  203  OF 


Broadband 
Signal-to-Noise 
Gain  (dB) 


Narrowband 
Signal-to-Noise 
Gain  (dB) 


Convergence  Rate 
(%  of  maximum) 


1. 877 


1.  922 


1.  937 


1. 887 


1. 745 


1.  609 


1.472 


1.457 


1. 600 


1. 906 


2.  208 


2.  333 


- _  ..  , , 


TABLE  IV-11 


ADADPi:r/ILTERING  signal-to-noise  gain  versus  convergence 

MofcJ  A  VERY  STRONG  SIGNAL  FROM  KAMCHATKA  (USING 
NOISE  REDUCTION  MEASUREMENTS  FROM  DAY  203  OF  1971) 


Convergence  Rate 
(%  of  maximum) 


10.  0 


15.  0 


20.  0 


25.  0 


Broadband 
Signal- to -Noise 
Gain  (dB) 


0.  742 


0.  720 


0.  620 


0.465 


0.  115 


-0.423 


-1.243 


-1. 874 


-2. 585 


-2.817 


-2. 625 


-2.  215 


-1. 854 


-1. 731 


•1.  683 


IV -36 


Narrowband 
Signal- to -Noise 
Gain  (dB) 


0.  880 


0.  876 


0.  788 


0.  642 


0.  302 


-0.  320 


•1.  053 


-1. 692 


-2.402 


-2. 591 


-2. 358 


-1. 900 


-1. 507 


-1. 369 


-1. 315 


jPBSfT’ 


IV -37 


near  0.  5%  in  the  case  of  the  18-dB  Kamchatka  signal.  The  rise  in  gain  is 
due  to  a  flattening  of  signal  degradation  at  convergence  rates  below  those 
where  maximum  noise  reduction  is  obtained  (see  Figures  IV-9  and  IV- 10). 
This  leveling  off  contrasts  with  signal  degradation  which  is  still  climbing  at 
a  25%  convergence  rate  in  the  case  of  the  weaker  6-dB  signal  from  300°-305° 
(see  Figure  IV-7).  At  any  rate,  there  does  not  seem  to  be  any  way  to  exploit 
the  extra  noise  reduction  derived  from  transient  correlation  between  succes¬ 
sive  data  vectors.  Such  noise  reduction  would  cease  abruptly,  anyhow,  if 
the  filter  were  frozen  and  the  adaptive -filtering  process  could  no  longer 
track  short-term  fluctuations  in  thp  data.  In  addition,  severe  signal  distor¬ 
tion  is  a  compelling  reason  for  avoiding  the  higher  convergence  rates. 

The  next  section  of  this  report  discusses  the  effects  of  stopping  the 
filter  update  in  an  attempt  to  eliminate  the  severe  degradation  of  easily  de¬ 
tected  signals. 


IV-38 


SECTION  V 

EFFECT  OF  FILTER  FREEZE  ON  SIGNAL-TO-NOISE 
RATIO  IMPROVEMENT 


A.  GENERAL  DISCUSSION 

The  maximum-likelihood  adaptive -filtering  equations  are  designed  to 
preserve  a  signal  b,  constraining  the  filter  set  to  have  unity  response  in  the 
signal  direction  while  optimally  reducing  the  total  filter  output  power  subject 
to  the  unity-response  constraint.  When  no  signal  present_  the  adaptlve 

ftlter  set  optimally  reduces  the  filtered  noise  power  subject  to  the  constraint 
conditions.  There  are  no  inherent  design  problems  unless  a  signal  is  present. 
If  a  signal  is  present,  however,  the  adaptive  filter  set  will  attempt  to  mini¬ 
mise  the  total  output  power  by  manipulating  the  filtered  noise  output  so  as  to 
cancel  as  much  of  the  filtered  signal  as  possible  while  still  observing  the  con¬ 
straint  conditions.  The  constraint  conditions  are  a  significant  but  not  insur¬ 
mountable  barrier  to  signal  cancellation.  In  the  particular  form  of  the  update 
algorithm  used  in  this  study,  furthermore,  the  step  sire  of  the  filter  update 
vector  increases  without  limit  as  the  time-shifted  input  data  channels  become 

more  and  more  alike:  the  squared  magnitude  of  the  filter  change  vector  was 
shown  in  Subsection  E  of  Section  III  to  be 


lAneW-  Aoldl 


4K3  y2 (t) 

lx-xl2 


where  y(t>  is  the  adaptive  filter  output;  as  the  data  vector  X  approaches  the 

|enTwteeroirt0r  X  'See  Page  I!I-‘  the  ^finitions),  the  step  size 

approaches  infinity.  (In  practice,  the  step  size  is  limited  by 


V-l 


contaminating  noise,  differences  in  instrument  response,  and  signal  propa¬ 
gation  across  the  array  not  consistent  with  the  signal  model.  )  To  prevent 
signal  cancellation  and  filter  instability,  the  adaptive  filter  set  is  frozen 
when  a  signal  is  detected.  After  the  signal  has  died  out,  the  filter  set  is 

again  allowed  to  adapt  with  the  aim  of  reducing  the  output  power  of  a  changed 
noise  field. 

A  simple  power  detector  was  originally  tried  as  a  detection  algorithm.. 
A  running  power  average  was  computed  for  the  adaptive -filter  output  beam. 
Each  point  of  the  beam  was  checked  to  see  whether  its  square  exceeded  some 
arbitrary  multiple  of  the  running  power  average  for  the  beam.  There  were 
two  serious  problems  with  this  method  of  detection: 

•  The  running  pov/er  average  tended  to  increase  with  time,  even 
when  it  was  not  updated  after  a  signal  detection.  The  time  re¬ 
quired  to  fall  from  a  peak  level  was  greater  than  the  time  re¬ 
quired  to  climb  to  that  peak  level.  Signals  just  below  the  det- 
tection  threshold  then  drove  the  running  power  average  still 
higher  (and  thereby  raised  the  detection  threshold). 

•  Noise  fluctuations  and  glitches  produced  frequent  false  alarms 
wlen  the  threshold  was  set  low  enough  to  detect  signals  clearly 
visible  on  the  beam  output  trace. 

A  detection  algorithm  was  chosen  specifically  to  eliminate  excessively 
large  jumps  in  the  adaptive  filter  vector.  For  a  fixed  convergence  rate  K 
the  step  size  is  proportional  to  |  y(t)  |  /  |  X  -X  |  .  When  this  ratio  is  large,  ^t 
is  likely  that  a  signal  is  present.  To  eliminate  the  dependence  of  this  ratio 
on  the  filter  weights,  the  adaptive  filter  output  y(t)  =  XTA  is  replaced  by  the 

RMS  beamsteer  output  over  a  31 -point,  31 -second  gate.  The  final  form  of  the 
detection  ratio  is 


V-2 


M  N 


xTx 


E  E 


_2 

x  (t-j) 


(X  -X)T(X  -X) 


i=l  j  =  -N 
M  N 


E  E  [x(t-ji  ■  xi<*'j)j 


i=l  j  =  -N 


where  M  is  the  number  of  channels,  2N+1  is  the  total  number  of  filter 


weights  per  channel,  lc(t-j)  is  the  beamsteer  output  at  time  t-jAt,  and 
x.(t-j)  is  the  prefiltered  vertical-component  output  for  site  i  at  time  t-jAt. 
This  ratio  is  a  measure  of  the  similarity  between  the  input  channels  and  the 


beamsteer  output.  As  it  turns  out,  the  detection  ratio  is  a  scaled  version  of 

_  -  .  r  .  .  .  i  i  T .  .  ” 1  /  r*  ...  .  .  \  T  .  .  r  1  1  rr- 1  T — '  •  1  . 


detection  ratios  (or  two  minutes  of  data)  fall  below  the  threshold. 


Subsection  B  shows  what  happens  to  the  two  large  Kamchatka  events 


from  the  previous  section  when  the  adaptive  filter  is  frozen  after  a  signal  de¬ 
tection.  In  addition,  it  illustrates  the  capability  of  adaptive  multichannel 
filtering  to  suppress  off-azimuth  events  better  than  the  standard  time-shift- 
and-sum  beamforming  method.  Subsection  C  tackles  the  difficult  problem  of 
estimating  the  loss  in  noise  reduction  during  the  time  period  when  the  adaptive 
filter  is  prevented  from  updating.  Subsection  D  describes  an  alternate  adaptive 


the  Fisher  detector  [(M-l)X  x]/  [(X-X)  (X-X)]  .  The  Fisher  detector 

has  been  studied  extensively  (Melton  and  Bailey*  1957;  Booker,  1965;  Edwards, 
Benno  and  Creasey,  1967;  Shumway  and  Husted,  1970;  Blandford,  1970;  Shum- 
way,  1971;  Wirth,  1 97 1 ;  Smart  and  Fiinn,  1971;  Wirth,  Blandford  and  Shumway, 
1971;  Shumway,  1972;  Blandford,  1972;  Smart,  1972).  In  the  results  which 
follow,  the  threshold  value  for  the  detection  ratio  is  4  (sufficient  to  detect 
signals  12  dB  above  the  noise  on  the  beamsteer  output  for  six  sites  at  ALP  A). 
Once  the  detection  ratio  exceeds  4,  the  filter  set  is  frozen  until  120  successive 


% 

| 

f  * 


r 


algorithm  which  appears  to  have  definite  advantages  over  the  algorithm 
actually  used.  The  principal  advantage  of  the  suggested  algorithm  is  greatly 
reduced  signal  degradation. 


B.  EFFECT  OF  FILTER  FREEZE  ON  SIGNALS 

A  magnitude  5.  9  earthquake  from  New  Guinea  on  day  7  of  1972  was 
selected  to  test  the  filter  freeze  procedure  and  observe  its  effects  on  the  sig¬ 
nal.  The  PDE  bulletin  gives  its  origin  time  as  06:25:48.4  and  its  location  as 
(2.  IS,  1  39.  0E).  The  azimuth  from  ALPA  is  253  .  Both  the  beamsteer  and 
adaptive-filter  beams  were  aimed  in  this  direction  to  pass  energy  of  3.5km/ 
sec  velocity.  To  prevent  clipping  of  this  signal,  which  was  more  than  42  dB 
above  the  noise  on  the  beamsteer  output,  the  data  was  scaled  by  a  factor  of 
only  4  instead  of  16.  The  adaptive -filter  package  was  run  from  0430  to  0826 
at  a  convergence  rate  of  0.5%  using  sites  8,  9,  12,  15,  16,  and  17  as  input  to 
the  adaptive -filter  beam. 

The  similarity-detection  algorithm  had  no  difficulty  in  spotting  this 
event:  the  adaptive  filter  set  was  frozen  several  times  during  the  Rayleigh 
wave  arrivals  from  this  event.  The  main  Rayleigh  wave  arrived  at  about 
0708  and  is  shown  in  Figure  V-l.  Tick  marks  at  zero  level  in  both  beams 
ate  separated  by  5  minutes  and  indicate  the  times  given  midway  between 
traces.  The  time-shift-and-sum  output  is  at  the  top,  the  adaptive -filter  out¬ 
put  at  the  bottom.  The  Rayleigh-wave  arrival  for  which  both  beams  are  stored 
is  virtually  intact.  Signal  degradation  measured  over  a  512-point  gate  starting 
at  the  beginning  of  segment  39  in  the  figure  was  just  under  1.  3  dB  both  broad¬ 
band  and  in  the  frequency  band  associated  with  periods  between  43  and  15 
seconds.  This  event  is  so  strong  that  the  P-wave  arrival  and  other  seismic 
phases  ahead  of  the  Rayleigh  wave  are  visible  on  the  beam  outputs.  Since  no 
appreciable  degradation  occurs  for  other  events  during  the  freeze  period  when 
the  Rayleigh  wave  is  the  only  visible  phase,  it  is  likely  that  the  adaptive - 
filtering  process  is  rejecting  some  of  the  Rayleigh-wave  arrival  on  an  azimuthal 


FIGURE  V-l 

MAIN  RAYLEIGH  WAVE  ARRIVAL  FROM  5.  9  NEW  GUINEA 
EVENT  (JANUARY  7,  1972) 


basis  (using  the  earlier  phases  of  the  event).  Another  possibility  is  that  the 
signal  degradation  is  due  to  adaptation  over  the  short  time  interval  immed¬ 
iately  before  signal  detection. 

To  observe  the  effects  of  the  filter  freeze  on  the  two  Kamchatka 
events  of  the  previous  section,  the  same  sample  as  before  from  day  276  of 
1971  was  reprocessed  with  the  filter  freeze  threshold  set  to  4.  The  conver¬ 
gence  rate  was  specified  as  0.  5%  of  maximum  in  the  runs  which  produced 
the  figures  shown  in  the  remainder  of  this  subsection.  Several  events  from 
the  PDE  bulletin  and  LASA  bulletin  had  arrival  times  within  this  period. 

Table  V-l  gives  the  PDE  events  which  arrived  during  the  sample  interval. 

In  addition  to  these  events,  two  events  located  within  three  degrees  of  the 
Panama  event  were  included  in  the  LASA  bulletin  covering  this  time  interval. 
The  first  of  these  had  a  measured  LASA  bodywave  magnitude  of  3.  6  and 
arrived  at  LASA  47  minutes  and  34  seconds  before  the  PDE  event  from  Panam 
The  second  had  a  measured  LASA  bodywave  magnitude  of  3.  9  and  arrived  at 
LASA  6  minutes  and  38  seconds  before  the  PDE  event  from  Panama. 

The  adaptive  filter  beam  was  steered  to  pass  events  from  an  azimuth 
of  273  in  order  to  pick  up  the  two  events  from  the  Kamchatka  region.  Sites 

8,  9,  12,  15,  16,  and  17  of  the  ALPA  array  were  used  to  form  the  beamsteer 
output  and  adaptive -filter  beam. 

The  first  clear  arrival  at  ALPA  is  shown  in  Figure  V-2.  Apparently 
it  is  the  magnitude  3.  6  Panama  event  from  the  LASA  bulletin.  Although 
clearly  visible  in  the  beamsteer  output,  it  is  almost  obliterated  in  the  adaptive 
filter  beam.  During  this  event,  the  adaptive  filter  set  was  being  updated.  This 
off  - azimuth  signal  was  attenuated  so  strongly  in  the  adaptive -filter  beam  be¬ 
cause  the  adaptive -filter  beam  reacted  to  the  presence  of  this  event  and  nulled 
it  out:  a  steadily  increasing  reduction  of  the  power  in  this  event  over  the  first 
few  cycles  can  be  seen  in  Figure  V-2  on  the  adaptive -filter  trace. 


Beamsteer  Output 


o  I  SEGMENT  9 

276. 20.  30. 03 


Sfc'GKFiNT  10 
276. 20. 40. 03 


100 

rryi 


Adaptive  Filter  Output 


FIGURE  V-2 

EVENT  PROBABLY  FROM  SOUTH  OF  PANAMA  (LASA  m,  •  3.6) 

D 


V-8 


The  first  of  the  two  on-azimuth  events  from  Kamchatka  is  plotted  in 
Figure  V-3.  The  filter -freeze  algorithm  prevented  the  adaptive  filter  set 
from  being  updated  during  this  event.  As  a  result,  signal  degradation  for 
this  event  was  less  than  0.  1  dB.  There  is  almost  no  perceptible  difference 
between  the  beamsteer  and  adaptive -filte r  traces  in  Figure  V-3. 

The  magnitude  3.  9  Panama  event  from  the  LASA  bulletin  did  not 
appear  on  either  beam. 

On  the  other  hand,  the  magnitude  4.  7  Panama  event  from  the  PDE 
bulletin  was  clearly  evident  on  the  beamsteer  output.  This  event  is  pictured 
in  Figure  V-4.  As  in  the  case  of  the  event  47  minutes  earlier,  it  is  sign¬ 
ificantly  weaker  on  the  adaptive -filter  beam.  The  adaptive  filter  set  was  up¬ 
dated  throughout  this  event. 

Figure  V-5  displays  the  weak  event  from  300°-305°  used  in  the  pre¬ 
vious  section  for  signal  degradation  measurements.  In  this  case,  however, 
the  27  3°  steer  direction  transforms  this  event  into  an  off-azimuth  event. 
Attenuation  is  0.  56  dB  broadband  and  0.  57  dB  narrowband  when  the  detection 
of  the  first  Kamchatka  event  in  Figure  V-3  triggers  filter -update  suppression 
over  that  event.  Curiously  enough,  the  attenuation  for  this  weak  event  from 
300°-305°  is  -0.  35  dB  broadband  and  -0.  36  dB  narrowband  when  the  adaptive 
filter  is  permitted  to  vary  during  the  preceding  Kamchatka  event.  That  is  to 
say,  when  the  only  difference  in  processing  is  to  permit  or  to  suppress  the 
adaptive -filter  update  algorithm  during  an  on-azimuth  event  30  minutes  earlier, 
the  resultant  attenuation  varies  by  almost  0.  9  dB.  In  both  cases,  the  steer 
direction  is  273°,  the  convergence  rate  is  0.  5%,  the  same  sites  are  utilized, 
and  the  filter  set  is  adapting  for  at  least  20  minutes  after  the  earlier  Kamcha¬ 
tka  event  has  faded  away.  Figure  V-6  presents  the  same  weak  off-azimuth 
event  after  allowing  the  filter  set  to  vary  over  the  previous  event  from  Kam¬ 
chatka.  This  figure  does  indeed  appear  to  contain  a  cleaner  estimate  of  the 
signal.  A  possible  explanation  is  that  the  adaptive -filter  beam  pattern  exceeds 


V  -9 


Beams  tee  r  Output 


Adaptive  Filter  Output 


FIGURE  V-  3 

MAGNITUDE  4.  5  EVENT  FROM  KAMCHATKA 
(FILTER  FROZEN,  STEER  DIRECTION  273°,  K^  =  0.  005) 


V-IO 


Beamsteer  Output 


Adaptive  Filter  Output 

\rfy'\ A/\/w/  \/ \,/\j  \/V/  V 


FIGURE  V-4 

MAGNITUDE  4.  7  EVENT  FROM  SOUTH  OF  PANAMA 


V-ll 


Beam  steer  Output 


vwyv 


ShGMOJT  24 
276,21. 


to.  nj 


Sf:GMt>JT  25 
276.21.45,03 


too 

mfi 


Adaptive  Filter  Output 


...  -,W ^A./v.yV >,%.• \/\AA/y\A/\. VW-^VW\A- 


FIGURE  V-5 

WEAK  EVENT  FROM  300°. 305° 

(FILTER  ADAPTING,  STEER  DIRECTION  273°,  K  =0.005 
FILTER  FROZEN  DURING  EARLIER  KAMCHATKA  EVENT) 


V- 12 


Beamsteer  Output 


VWVv  v/^A^4 ' 


A  ,4 


AA/\/V 


SEGMENT  23 
cd76.2U4G.03 


SEGMENT  25 
2/6.  2U  45.  03 


100 

mp 


Adaptive  Filter  Output 


vvvvAV V*vA/V  W WV\A/v 


FIGURE  V-6 


WEAK  EVENT  FROM  300°-305° 


(FILTER  ADAPTING,  STEER  DIRECTION  273  ,  K  =0.005, 
FILTER  ADAPTING  DURING  EARLIER  KAMCHATKA  EVENT) 


V-l  3 


1 

I 


a  value  of  0  dB  at  300  -305  azimuths  at  the  time  of  arrival  of  the  off-azimuth 
even,  in  Figures  V-5  and  V-6.  Array  response  patterns  with  maxima  away 
from  the  steer  direotion  are  often  generated  by  multichannel  filtering,  espe¬ 
cially  when  no  significant  energy  is  coming  from  the  direction  corresponding 
to  the  beam-pattern  maximum.  If  this  interpretation  is  correct,  i,  is  still  a 
moot  question  whether  the  deflection  of  the  main  lobe  to  azimuths  north  of  273° 
had  already  taken  place  by  the  end  of  the  Kamchatka  even,  or  whether  the  char¬ 
acteristics  of  the  intervening  Panama  event  contributed  to  the  end  result. 

The  second  Kamchatka  event  from  the  POE  bulletin  is  presented  in 
Figure  V-7.  The  filter  se,  was  frozen  during  this  event.  This  figure  should 
be  compared  with  Figure  IV-11,  where  the  filter  is  adapting.  Signal  degra¬ 
dation  for  this  on-azimuth  even,  is  almost  exactly  0  dB.  As  in  the  case  of 

the  earlier  Kamchatka  event,  there  is  almost  no  difference  between  the  two 
beam  outputs. 

No  definite  detection  could  be  made  for  the  magnitude  4.  8  event  from 
the  New  Britain  region. 

The  final  even,  from  the  PDE  bulletin,  a  magnitude  4.  3  even,  from  the 
Andreanof  Islands  (Figure  V-8),  was  detected  on  both  the  beamsteer  and 
adaptive-filter  beams.  However,  the  similarity  between  the  beamsteer  out¬ 
put  and  the  input  channels  was  too  low  to  freeze  the  adaptive  filter  set.  This 
earthquake  illustrates  the  effect  of  adaptive  filtering  on  a  strong  off-azimuth 
atgnal  reasonably  close  (36°)  to  the  steer  direction.  Despite  the  loss  of  one 
input  channel  and  the  resultant  redistribution  of  filter  weights  jus,  before  the 
principal  burst  of  energy  on  both  traces,  attenuation  was  more  than  6  dB 
relative  to  the  beamsteer  output.  The  marked  attenuation  demonstrates  the 
sometimeu- for  gotten  capability  of  multichannel  filtering  to  narrow  the  width 

of  the  main  lobe  in  the  array  beam  Datterrv  u 

y  m  pattern,  the  array  becomes  a  superdirective 

antenna. 


V  - 1 4 


j  !  sf;gmf:nt  31 

22.  1C.  03 


SF.G*!t:iF  32 
276.22.  15.0.3 


100 

nyi 


Adaptive  Filter  Output 


f 


FIGURE  V-7 

MAGNITUDE  4.  9  EVENT  FROM  KAMCHATKA 
(FILTER  FROZEN.  STEER  DIRECTION  273°,  K^O.QOS) 


V-15 


Beamsteer  Output 


GME'NT  49 


SEGMENT  50 
276. 23. 30. 03 


SFGMFNT  51 
276.  23,  35, 03 


Adaptive  Filter  Output 


FIGURE  V-8 

MAGNITUDE  4.  3  EVENT  FROM  ANDREANOF  ISLANDS 


f 


The  following  conclusions  can  be  drawn  from  this  four -hour  data 

sample: 

•  On-azimuth  signals  at  ALPA  are  not  significantly  attenuated 
by  adaptive  filtering  at  convergence  rates  near  optimum  for 
suppression  of  background  noise.  Attenuation  was  0.4  dB  for 
the  weak  signal  from  300°-305°  (see  Table  IV-3)  with  the  filter 
adapting  and  steer ed  toward  302.  5°.  To  stop  degradation  of 
stronger  signals,  the  filter  must  be  frozen. 

•  Off-azimuth  events  are  strongly  suppressed  in  the  adaptive- 
filter  beam  when  the  filter  set  is  not  frozen.  Some  off-azimuth 
signals  are  virtually  annihilated.  The  more  powerful  the  off- 
azimuth  event,  the  more  it  is  smothered. 

•  Greater  directional  resolution  at  ALPA  and  other  similar  long- 
period  arrays  is  achievable  through  the  beam-narrowing  capa¬ 
bility  of  multichannel  filtering.  This  fact  is  extremely  im¬ 
portant  if  long-period  arrays  of  this  type  are  to  be  used  for 
signal  detection  and  location  or  for  separation  of  multiple 
events.  With  time-varying  adaptive  filters,  off-azimuth  events 
can  be  nulled  out  in  proportion  to  their  signal-to-noise  ratio 
with  possible  complications  if  two  events  overlap  in  time.  With 
fixed  non-varying  multichannel  filters,  superdirectivity  can  be 
preserved  in  all  circumstances,  but  then  the  ability  to  quell 
specific  bursts  of  off-azimuth  energy  in  an  on-line  processing 
mode  is  impaired. 

C.  EFFECT  OF  FILTER  FREEZE  ON  NOISE  REDUCTION 

In  Section  IV,  signal-to-noise  ratio  improvement  was  calculated  by 
subtracting  signal  degradation  from  noise  reduction  when  the  adaptive  filter 
set  is  permitted  to  update  each  time  a  new  sample  of  data  is  available,  even 


V  -  I  7 


when  an  on-azimuth  signal  is  detected.  In  this  section,  the  method  of  deter¬ 
mining  signal-to-noise  gain  again  involves  the  same  two  quantities,  but  in 
this  case  they  must  be  measured  so  as  to  reflect  the  fact  that  the  filter  update 
has  ceased.  For  signal  degradation,  the  required  procedure  is  relatively 
simple  and  was  performed  in  Subsection  B:  only  in  the  instance  of  the  42 -dB 
signal  on  day  7  of  1972  was  the  degradation  significant  (1.  3  dB);  the  18-dB 
and  24-dB  signals  on  day  276  of  1971  were  attenuated  less  than  0.  1  dB.  The 
measurement  of  the  loss  in  noise  reduction,  however,  poses  some  problems. 
As  the  elapsed  time  increases  from  the  point  of  the  filter  freeze,  the  general 
trend  is  one  of  even  greater  degradation  in  comparison  with  an  adapting  filter 
set.  There  are  periods  of  time,  however,  when  the  frozen  filter  set  is  better 
able  to  reduce  the  noise  power  than  the  adapting  filter  set.  The  superior  per¬ 
formance  of  the  frozen  filter  set  is  due  to  temporary  instances  where  the 
characteristics  of  the  noise  field  revert  to  a  condition  more  typical  of  the 
period  before  the  filter  freeze.  An  attempt  is  made  to  put  bounds  on  the 
trend  of  the  drop  in  noise  reduction  as  a  function  of  time  elapsed  since  the 
filter  freeze:  this  attempt  is  a  subjective  interpretation  of  the  rapidly  fluctu¬ 
ating  measurements  of  the  drop  in  noise  reduction. 

lo  determine  the  effect  on  noise  reduction  of  freezing  an  adaptive  filter 
set  when  an  on-azimuth  signal  is  detected,  a  four -hour  noise  sample  from  day 
232  of  1970  was  selected  for  processing.  The  time  period  covered  is  from 
0344  to  0741.  No  events  are  reported  by  the  PDE  bulletin  between  1219  on 
August  19  and  0834  on  August  20  {day  232).  A  360°  Fisher -detector  scan  of 
the  four-hour  sample  at  24°  azimuthal  increments  suggests  two  possible  sig¬ 
nal  arrivals,  one  at  0406  from  a  252°  azimuth  and  another  at  0506  from  an 
azimuth  between  12°  and  36°.  In  the  first  case,  a  faint  dispersed  wavetrain 
is  visible  between  0406  and  0416  on  the  270°  vertical-component  beam.  A 
trained  analyst  would  probably  call  it  a  signal.  The  second  possible  arrival 
at  0506  is  slightly  weaker  than  the  first  on  the  Fisher -detector  scan.  The 


V-18 


only  computed  beam  at  270  is  too  far  away  from  the  arrival  direction  to  de¬ 
cide  whether  a  signal  is  present.  These  two  possible  signals  should  have 
only  minuscule  effects  on  noise -reduction  results  during  the  critical  final 
hour  of  the  data  sample. 

In  processing  this  sample,  sites  3,  4,  5,  6,  8,  and  9  are  used  to  form 
vertical-component  input  channels  for  the  beamsteer  output  and  adaptive- 
filter  beam.  Both  beams  are  steered  to  pass  energy  arriving  from  an  azimuth 
of  270  at  a  velocity  of  3.  5  km/  sec.  The  adaptive-filter  convergence  rate  is 
set  at  0.  5%  of  maximum.  The  noise  sample  is  processed  twice.  In  both 
cases,  the  filter  is  permitted  to  adapt  for  the  first  three  hours.  In  the  first 
computer  run,  it  is  allowed  to  adapt  for  the  remaining  hour.  In  the  second 
run,  it  is  frozen  for  the  remainder  of  the  run  in  order  to  deduce  the  loss  in 
noise  reduction  as  a  function  of  the  elapsed  time  since  the  filter  freeze.  In 
both  runs,  time  is  divided  into  55  equal  256-second  segments.  Segments  40- 
55  correspond  to  the  period  in  which  the  adaptive -filtering  update  process  is 
suppressed  in  the  second  computer  run.  Table  V-2  gives  the  broadband  noise 
reduction  when  the  filter  is  adapting  and  when  it  is  frozen,  together  with  the 
difference  between  the  two  modes  of  operation.  These  figures  are  given  for 
each  of  the  16  segments,  for  8  two-segment  intervals,  4  four -segment  inter¬ 
vals,  2  eight-segment  intervals,  and  the  entire  J  6-segment  period  from  0629 
to  0737.  Figure  V-9  plots  the  measured  loss  in  noise  reduction  as  a  function 
of  the  time  since  the  filter  was  frozen.  Each  point  in  the  rightmost  column 
of  Table  V-2  is  located  at  the  middle  of  the  time  interval  it  designates.  Re¬ 
sults  are  quite  variable,  so  much  so  that  the  loss  in  noise  reduction  does  not 
increase  monotonicaliy  until  eight  segments  are  grouped  together.  The  over¬ 
all  trend,  of  course,  is  toward  ever  greater  loss  of  noise  reduction.  Figure 
\7-10  attempts  to  portray  the  apparent  trend  of  the  drop  in  noise  reduction  due 
to  freezing  the  adaptive  filter  set.  It  is  assumed  that  the  loss  in  noise  reduc¬ 
tion  climbs  monotonicaliy  from  zero,  starting  at  the  point  in  time  where  the 


V-  1  9 


TABLE  V-2 


BROADBAND  NOISE  REDUCTION  AS  A  FUNCTION  OF  TIME 
FOR  AN  ADAPTING  FILTER  SET  AND  A  FROZEN  FILTER 
SET  {USING  A  DATA  SAMPLE  FROM  DAY  232  OF  1970) 

(PAGE  1  OF  2) 


Segment 


Broadband  Noise  Reduction  (dB) 
Filter  Adapting  Filter  Frozen 


Lo  s  s 


2.  124 
1. 419 
1.  191 
1.  803 
1.  348 


3,  305 


0.  688 
3.  109 
1.  702 
3.  249 
0.  063 

1.  227 
0.  592 

2.  649 

0.031 


1.811 
1.  262 
0.  703 

1.  150 
1. 088 

2.  931 
-0. 040 

2.  625 
1.  564 


2.  526 


■0. 710 
1.  736 


-0. 202 


1.  742 


■0. 906 


0.  313 
0.  157 
0.  488 
0.  653 
0.  260 
0.  374 
0.  728 
0.  784 
0.  138 
0.  723 
0.773 
-0.  509 
0.  794 
0.  907 
0.  937 


1.  971 


1.  217 


0.  754 


S 


TABLE  V-2 

BROADBAND  NOISE  REDUCTION  AS  A  FUNCTION  OF  TIME 
FOR  AN  ADAPTING  FILTER  SET  AND  A  FROZEN  FILTER 
SET  (USING  A  DATA  SAMPLE  FROM  DAY  232  OF  1970) 

(PAGE  2  OF  2) 


Segment 
40-41 
42-43 
44-45 
46-47 
48-49 
50  -bj. 
52-53 
54-55 


40-43 

44-47 

48-51 

52-55 


40-47 


Broadband  Noise  Reduction  (dB) 
Filter  Adapting  Filter  Frozen 


1. 731 
1. 422 
2.  224 
1.  826 
2.  552 
0.  693 
1.  594 
1. 092 


1.  546 
2.022 
1. 693 
1.  274 


1.  507 
0.  874 
1.  919 
1. 079 
..  109 


.  525 


.  244 


.122 
.  482 
.  390 
.  428 


Loss 

dB 

0.  224 

0.  548 

0.  305 

0.  747 

0.  443 

0.  168 

0.  842 

0.  848 


.  424 
.  540 
.  30  3 
.  846 


PP5P*W 


A/fr T'.'i'VW-  •  iy.V! 0 W/Pc*. *  ■'  -.1. '.j- 


FIGURE  V-9 

MEASURED  LOSS  IN  NOISE  REDUCTION  AS  A  FUNCTION 
OF  TIME  ELAPSED  SINCE  FILTER  FREEZE 
(USING  NOISE  FROM  DAY  232  OF  1970) 


V-22 


■i'  V  J  riiT  '-f-rff rttf  Jifcil'i1hSrritjiiilii 


_ 


Noise  Reduction  Over  All  16  Segments 


update  process  ceases.  The  shaded  area  indicates  the  area  between  the  like¬ 
ly  upper  and  lower  bounds  for  the  trend.  The  dotted  curve  within  the  shaded 
area  is  the  estimate  of  the  trend.  Shown  at  the  top  of  the  figure  is  the  1.  6  dB 
noise  reduction  achieved  by  the  adapting  filter  set  over  the  interval  0629-0737. 

To  determine  the  signal-to-noise  gain  for  a  detected  event,  the  signal 
degradation  with  the  filter  frozen  is  subtracted  from  the  noise  reduction  to 
obtain  the  signal-to-noise  gain  at  the  time  when  the  filter  ceases  to  vary.  For 
the  day-276  Kamchatka  signal  18  dB  above  noise  level  on  the  beamsteer  output, 
broadband  signal  degradation  was  0.  087  dB.  Therefore  the  1.  616  dB  noise  re¬ 
duction  over  segments  40-55  of  the  sample  from  day  232  of  1970  at  a  conver¬ 
gence  rate  of  0.  5%  would  yield  a  signal-to-noise  ratio  improvement  of  1.  529 
dB  at  the  time  the  filter  is  frozen.  From  this  value,  the  drop  in  noise  reduc¬ 
tion  due  to  freezing  the  filter  must  be  subtracted.  Figure  V-ll  shows  the  re¬ 
sultant  trend  of  signal-to-noise  gain  as  a  function  of  time  elapsed  since  the 
cessation  of  update  using  the  results  of  Figure  V-10.  The  gain  which  would 
have  been  achieved  if  the  filter  had  not  been  frozen  is  0.  923  dB  (1. 616  dB  - 
0.  693  dB)  at  a  convergence  rate  0.  5%  of  maximum.  Since  the  event  in  ques¬ 
tion  lasted  for  less  than  10  minutes,  it  is  clearly  desirable  to  suppress  the 
filter  update  upon  detection  of  this  signal. 

Even  more  dramatic  are  the  results  for  the  later  Kamchatka  event  24 
dB  above  beamsteer  noise  level.  Broadband  signal  degradation  was  -0.  004 
dB  with  the  filter  frozen,  so  that  signal-to-noise  ratio  improvement  falls  off 
starting  at  a  value  of  1.  620  dB.  This  figure  contrasts  with  a  signal-to-noise 

gain  of  -0.  361  dB  (1.  616  dB  -  1. 977  dB)  when  the  filter  adapts  at  a  0.  5%  con¬ 
vergence  rate. 

In  summary,  when  signals  reach  the  signal-to-noise  ratio  required  for 
detection,  greater  signal-to-noise  gain  is  generated  by  the  filter -freeze  pro¬ 
cedure.  The  advantages  of  preventing  the  filter  update  become  more  and  more 
dramatic  in  the  case  of  increasingly  strong  events. 


V  -24 


timate  of 

?nal-to-Noise 

■  in  From  Frozen 


figure  V-ll 


trend  of  SIGNAL -TO- 

FOR  KAMCHATKA  E \ 
NOISE  LEVEL  (USING 


D.  an  alternate  adaptive  algorithm 

Part  two  of  Subsection  III-D  showed  that  the  adaptive  algorithm 


A  subject  to 


.new  old  T  old  — 

A  =  A  +  2/i  X1  A°  (X  -X) 

minimizes  the  mean  square  adaptive  filter  output  AT  e["xXT1a  subiect  to 

M  L  J  J 

the  constraint  conditions  £  a.(j)=  g  (j  =  -N . -1,  0,  1 . N)  when 

i=l  J 

the  constant-valued  convergence  factor  fi  is  chosen  to  be  small  enough.  If 
such  an  algorithm  had  been  used,  bursts  of  power  from  off-azimuth  seismic 
events  would  have  been  the  dominant  factor  in  the  choice  of  the  filter  vectors 
A.  Since  seismic  events  cannot,  in  general,  be  expected  to  repeat  themselves 
adaptive  filters  designed  on  this  basis  would  devote  significant  effort  to  the 

elimination  of  energy  which  had  long  since  disappeared.  The  adaptive  algor¬ 
ithm 


new  old 
A  =  A  + 


2Ks  XTA°ld(X-X) 
(X-X)T(X-X) 


used  for  this  study  attempts  to  minimize  the  quantity 


T 

A  E 


(X-X)T(X-X) 


1V1 

subject  to  the  same  maximum -likelihood  constraints  Y*  a  (j)  =  g 

fcl  1  °j 

where  E  denotes  the  expectation  of  each  matrix  element  inside  the  brackets. 
To  prevent  the  squared  magnitude 


.new  Aold 
A  -  A 


4K Zs  y2(t) 

(X  -X)T(X  -X) 


V  -26 


of  the  filter  update  vector  for  this  algorithm  from  becoming  enormous  when 
a  strong  signal  traverses  the  array,  the  adaptive  filter  set  must  be  frozen 
whenever  a  signal  is  detected.  With  this  algorithm,  each  update  red  ices  the 
pcwer  output  for  the  old  data  vector  by  a  factor  (1-2K  f  regardless  of  the 
fact  that  the  data  vector  X  may  be  consistent  with  a  signal: 


(ATrWXXTAneW  =  (AT)0ldXXTA0id(l-2K  )2  . 

s 


The  alternate  algorithm 


,,  2K  XTA°ld(X-X) 
new  old  s  '  ' 

A  =  A  +  - — - — - 

T 

X  X 


attempts  to  minimize  the  quantity 


T 

A  E 


T 

X  X 


subject  to  the  same  maximum -likelihood  constraints.  With  this  algorithm, 
the  squared  magnitude 


.new  old 
A  -  A 


4K^  y2(t)  (X-X)T(X-X) 

T  2 
(X  X) 


of  the  filter  update  vector  decreases  when  a  strong  signal  propagates  acros: 
the  array.  If  the  updated  filter  vector  were  applied  to  the  old  data  vector, 
moreover,  the  adaptive  filter  output  would  be 


so  that  the  squared  filter  output  would  be  reduced  by  a  factor  of 

C 1  "  (X-X)  (X-X)/  X  xj  instead  of  (1  -  2K  )  ,  In  the  case  of  an  ideal 

S  s 

signal,  there  would  be  no  attenuation  at  all.  With  weak  signals  having  signal- 
to-noise  ratios  too  low  to  trigger  a  filter  freeze,  this  algorithm  appears  to 
have  a  definite  advantage  over  the  algorithm  actually  employed:  the  conver¬ 
gence  factor  for  the  alternate  algorithm  would  probably  be  significantly 

greater  than  for  the  algorithm  actually  used  if  signal  degradation  were  the 
same  for  both  algorithms,  so  that  greater  noise  reduction  and  hence  greater 
signal-to-noise  gain  might  be  achieved  by  the  alternate  algorithm.  The  sug¬ 
gested  algorithm,  like  the  one  actually  implemented,  would  also  be  less  sen¬ 
sitive  to  power  bursts  from  seismic  events,  so  that  non-repeating  seismic 
events  would  be  "forgotten"  more  rapidly  than  with  the  algorithm 


new  .old  „  T  old  — 
t  =  A  +  ZfjL  X  A  (X-X) 


Normalization  of  the  data  vector  X  by  its  absolute  value  |x|  in  the  pro¬ 
posed  algorithm  instead  of  the  absolute  value  |  X  -  X |  ,  furthermore,  is 
sensibly  motivated:  the  adaptive  filter  responds  to  the  phase  characteristics 
of  the  data  vector  X  and  not  to  its  magnitude.  It  is  difficult  to  describe  pre¬ 
cisely  the  effect  of  the  normalizing  factor  |  X  -  X  |  . 


V-28 


SECTION  VI 

VARIABILITY  OF  NOISE  REDUCTION 


A.  INTRODUCTION 

In  Subsection  B,  the  results  of  adaptive  processing  on  two  data  samples 
are  studied  to  determine  the  variation  in  noise  reduction  at  different  steer  dir¬ 
ections.  In  processing  these  samples,  the  adaptive  filter  is  frozen  when  the 
similarity  detection  algorithm  spots  bursts  of  energy  from  the  look  direction. 
The  reason  for  allowing  the  filter  freeze  is  to  simulate  more  accurately  actual 
conditions  of  operation. 

In  Subsection  C,  the  noise  reduction  figures  from  all  noise  samples  pro¬ 
cessed  for  this  report  are  compared  in  order  to  assess  the  range  of  signal-to- 
noise  ratio  improvement  which  may  be  expected  from  adaptive  processing.  An 
evaluation  is  made  of  the  reasons  for  the  performance  achieved  at  each  steer 
direction  of  each  data  sample.  The  signal-to-noise  gain  estimates  for  noise 
samples  discussed  in  this  subsection  are  only  indirectly  related  to  the  super¬ 
directivity  effects  on  coherent  events  demonstrated  in  Section  V. 

B.  VARIATION  IN  NOISE  REDUCTION  AT  DIFFERENT  STEER 

DIRECTIONS 

The  first  data  sample  covers  the  period  0JI5  to  0510  on  day  321  of  1971. 
The  noise  level  of  this  sample  at  periods  between  15  and  20  seconds  is  close  to 
the  peak  level  for  the  1971  autumn  season.  Sites  1,  2,  3,  6,  9,  and  19  are  used 
as  input  channels  for  the  adaptive-filter  beams,  and  the  convergence  rate  is  set 
to  0.  5%  of  maximum.  The  adaptive -filte r  beams  were  aimed  toward  energy 
arriving  at  3.  5  km/ sec  from  azimuth  of  0°,  90°,  180°,  and  270°,  respectively. 
Since  the  adaptive  filter  set  was  frozen  no  fewer  than  seven  times  while  the 


steer  direction  was  90°,  much  of  the  energy  in  this  four-hour  sample  must 
have  come  from  azimuths  close  to  90°.  Figure  VI-1  presents  two  of  the 
microseismic  bursts  which  triggered  cessation  of  the  filter  update.  The  per¬ 
iod  of  oscillation  is  18  seconds  throughout  both  microseisms.  If  a  detection 
algorithm  is  to  distinguish  between  such  microseismic  activity  and  earthquakes, 
the  dispersion  characteristics  of  teleseismic  events  will  have  to  be  used. 
Broadband  noise  reduction  for  this  noise  sample  is  1.2  dB  at  0°,  1.4  dB  at  90°, 
3.  1  dB  a,  180°,  and  1.  5  dB  at  270°.  Corresponding  figures  for  noise  reduction 
m  the  1  3  -to-4  3 -second  band  are  1.  7  dB,  1.6  dB,  4.  1  dB,  and  2.  6  dB,  respec¬ 
tively.  Maximum  noise  reduction  is  3.  6  dB  at  the  14-second  period  for  the  0° 
steer  direction,  2.  3  dB  at  a  20-second  period  for  the  90°  steer  direction,  6.  3 
dB  at  an  18-second  period  for  the  180°  steer  direction,  and  4.  9  dB  at  an  18- 
second  period  for  the  270°  steer  direction.  Figures  VI-2  through  VI-9  show 
beam  spectra  and  noise  reduction  as  a  function  of  frequency,  respectively,  for 
the  four  look  directions.  Spectral  levels  are  given  in  dB  relative  to  one 

l"'1  THe  PrinciPal  microseismic  peak  near  18  seconds  is  strongest  on 

the  90  beam,  but  only  slightly  stronger  than  on  the  180°  beam.  Beamsteer 
output  levels  on  the  0°  and  270°  beams  are  a  few  dB  lower  at  18  seconds.  A 
visual  examination  of  the  beam  outputs  reveals  strong  microseismic  bursts  of 
18-second  period  on  both  the  90°  beam  and  180°  beam.  A  360°  Fisher -detector 
scan  of  this  data  sample  confirms  that  most  of  the  energy  is  concentrated  be¬ 
tween  100  and  132°  azimuths.  The  strong  noise  reduction  relative  to  beam¬ 
steering  on  the  180°  beam  is  due  to  the  supe rdirectivity  of  the  adaptive -filter 
beam.  I,  is  somewhat  puzzling  that  noise  reduction  is  noticeably  better  on  the 
270  beam  than  on  the  0°  beam.  The  WSW-ENE  orientation  of  the  six  sites 
used  as  input  channels  may  explain  the  difference:  the  main  lobe  of  the  array 
wavenumber  response  would  be  elongated  along  a  NNW-SSE  axis  so  that  the 
0  adaptive -filter  beam  would  have  more  difficutly  than  the  270°  adaptive -filter 
beam  in  rejecting  noise  from  a  120°  azimuth:  the  channel  amplitude  weightings 
would  thus  tend  to  be  larger  on  the  0°  beam  than  on  the  270°  beam,  and 


VI-2 


Beams  tee  r  Output 


Adaptive  Filter  Output 


FIGURE  VI  -1 

TWO  MICRO SEISMIC  BURSTS  WHICH  TRIGGERED  FILTER  FREEZE 
(DAY  321  1971,  STEER  DIRECTION  90°,  Kg  =  0.  005) 


VI-3 


Period  (Seconds) 


Frequency  (Hz) 


FIGURE  VI-2 


BEAMSTEER  AND  ADAPTIVE  FILTER  OUTPUT  POWER  DENSITY 


(DAY  321  1971,  STEER  DIRECTION  0 


K  =  n  nn^\ 


Period  (Seconds) 


64  32  16  8 


0.02  0.04  0.06  0.08  0.10  0.12  0.14 


Frequency  (Hz) 

FIGURE  VI-4 

BEAMSTEER  AND  ADAPTIVE  FILTER  OUTPUT  POWER  DENSITY 

(DAY  321  1971,  STEER  DIRECTION  90°,  K  =  0.  005) 

s 


64  32 


Period  (Seconds) 
16 


F requency  (Hz) 

FIGURE  VI-5 

ADAPTIVE  FILTER  NOISE  REDUCTION  AS  A  FUNCTION  OF  FREQUENCY 

(DAY  321  1971,  STEER  DIRECTION  90°,  K  =  0.  005) 

s 


VI-7 


I  -,-,.-.,— L-}:  Wil 


0.02  0.04  0.06  0.08  0.10  0.12  0.14 

F requency  (Hz) 

FIGURE  Vl-7 


ADAPTIVE  FILTER  NOISE  REDUCTION  AS  A  FUNCTION  OF  FREQUENCY 

(DAY  321  1971,  STEER  DIRECTION  180°,  K  =  0  005) 

s 


Period  (Seconds) 


64  32  16  8 


FIGURE  VI-8 

BEAMSTEER  AND  ADAPTIVE  FILTER  OUTPUT  POWER  DENSITY 
(DAY  321  1971,  STEER  DIRECTION  270°,  Kg  =  0.  005) 


VI-10 


0.02  0.04  0.06  0,08  0.10  0.12  0.14 

Frequency  (Hz ) 

FIGURE  VI-9 


ADAPTIVE  FILTER  NOISE  REDUCTION  AS  A  FUNCTION  OF  FREQUENCY 
(DAY  32  1  1971,  STEER  DIRECTION  270°,  K  =  0.005) 


consequently  poorer  performance  could  be  expected  for  noise  of  8 -second 
period,  which  tends  to  be  relatively  incoherent. 

The  second  data  sample  spans  the  interval  0355  to  0725  on  day  203  of 
1971.  This  sample  is  the  same  sample  described  in  Section  IV  (used  there  to 
study  noise  reduction  as  a  function  of  convergence  rate).  Sites  8,  13,  14,  15, 

16,  and  17  of  the  ALPA  array  are  the  input  channels  for  the  beamsteer  and 
adaptive-filter  beams.  The  convergence  rate  is  chosen  to  be  0.  5%  of  maximum. 
Again,  four  beams  are  steered  toward  azimuths  of  0°,  90°,  180°,  and  270°. 

This  noise  sample  contains  at  least  four  seismic  events.  Three  of  these  are 
reported  by  the  PDE  bulletin  and  are  given  in  Table  VI-1.  A  fourth  event 
which  triggered  a  filter  freeze  on  the  270°  beam  is  displayed  in  Figure  VI-10. 
This  fourth  event  corresponds  to  a  peak  at  0426-0427  in  the  275°  look  direction 
for  a  Fisher -detector  scan  from  180  to  300  at  5  azimuthal  increments  using 
18  sites.  The  first  New  Ireland  event  arrived  slightly  less  than  an  hour  later 
and  is  pictured  in  Figures  VI-11  and  VI-12  as  it  appeared  on  the  270°  and  180° 
beams.  A  glitch  near  the  beginning  of  this  event  (seen  most  clearly  in  Figure 
VI-11)  was  caused  by  a  burst  of  energy  on  a  single  channel.  Although  the  event 
is  located  at  an  azimuth  of  239°  with  respect  to  ALPA,  the  180°  beam  contains 
the  strongest  signal.  This  fact  suggests  that  the  apparent  direction  of  arrival 
is  closer  to  180  than  270  .  A  360  Fisher -detector  scan  at  24°  increments 
using  sites  8,  13,  14,  15,  16,  and  17  reaches  its  maximum  at  228°  before  the 
glitch  in  Figures  VI-11  and  VI-12  and  at  216°  after  the  glitch.  If  such  a  shift 
in  the  apparent  direction  of  arrival  away  from  the  direction  of  the  source  is  a 
normal  occurrence,  the  signal-to-noise  ratio  obtained  by  steering  toward  the 
source  is  lower  than  that  which  could  be  achieved  by  steering  toward  the 
apparent  direction  of  arrival  at  ALPA.  If  the  apparent  direction  of  arrival 
were  determined  before  steering  the  array,  some  processing  gain  might  be 
realized.  A  Fisher -detector  scan  at  5°  azimuthal  increments  from  180°  to 
300°  using  18  sites,  on  the  other  hand,  no  longer  exhibits  the  southward  bias 


VI-12 


Beam  steer  Output 


"Nv  v\/ '\J \j  \yf \  j  \J ^Wv-'  W^v/\y'\  /  v-v\A.,/^ 


st.s^e  rjT  r? 


4.  ;i,  •»! 


i  dr.:^4ir  9 
•"’fJ  3,  04,  Ob.  41 


S{T: 
20  J,  04, 


100 

m/i 


Adaptive  Filter  Output 


A'  \r \y^J\ ./  \J\  j  yf\  f^/Wv/vM  Aa/V/^/v'v^‘4 


FIGURE  VI-10 

WEAK  EVENT  WHICH  TRIGGERED  FILTER  FREEZE 
(DAY  203  1971,  STEER  DIRECTION  270°,  K>  0.005) 


VI-14 


I 


Beam  steer  Output 


Adaptive  Filter  Output 


FIGURE  VI-12 

NEW  IRELAND  EVENT  WITH  GLITCH  (ORIGIN  TIME  04:37:09.1) 
(DAY  203  1971,  STEER  DIRECTION  180°,  K8  =  0.  0005) 


VI-  16 


j 


'  . . . 

f  '  -  _  - _ _ _ 


in  the  apparent  arrival  direction.  The  directional  peak  varies  from  215°  to 

255°  during  the  time  interval  0517  to  0525.  Thus,  with  the  full  array,  the 

prospect  of  realizing  signal -to  -  noise  gain  by  steering  toward  the  apparent 

direction  of  arrival  seems  limited.  Figure  Vl-13  is  a  plot  of  the  quietest 

noise  section  of  the  four-hour  sample.  A  comparison  with  Figure  VI-1  (from 

autumn)  illustrates  the  wide  variation  in  noise  levels  and  spectral  content 

throughout  the  year.  A  comparison  of  Figure  Vl-13  with  the  seismic  events 

during  the  period  0355  to  0725,  on  the  other  hand,  indicates  that  a  significant 

portion  of  the  total  energy  in  the  sample  is  from  the  seismic  events.  Since 

PDE  events  occur  slightly  more  frequently  than  once  every  two  hours,  this 

situation  may  be  typical  of  summertime.  Figure  VI- 14  portrays  the  second 

New  Ireland  event  as  seen  by  the  270°  beam.  On  the  180°  beam,  the  energy 

level  is  about  the  same  as  on  the  270°  beam.  The  Fisher -detector  sweep 

using  sites  8,  13,  14,  15,  16,  and  1 7  peaks  at  a  228°  azimuth,  but  at  a  237.  5° 

azimuth  whan  18  sites  are  used.  Figure  VI-15  is  the  magnitude  5.4  event 

from  the  Northern  Celebes  region  as  seen  by  the  270°  beam.  Once  again,  the 

six-site  Fisher-detector  scan  indicates  an  arrival  azimuth  of  252°,  while  the 

cor  responding  18-site  scan  produces  azimuthal  estimates  ranging  from  250°  to 
o 

285  over  the  time  interval  0655-0700. 

broadband  noi  »e  reduction  for  this  sample  is  2.2  dB  at  0°,  1 . 4  dB  at 
90°,  2.4  dB  at  180°,  and  2.4  dB  at  270°.  The  corresponding  noise  reduction 
in  the  1 5  -  to  -4  3  -  second  band  is  2.4  dB,  1.4  dB,  2.  5  dB,  and  2.  6  dB,  respec¬ 
tively.  Maximum  noise  roduc  ion  is  3.  7  dB  at  a  23-second  period  for  the  0° 
beam,  3.  3  dB  at  a  51 -second  period  for  the  90°  beam,  5.  5  dB  at  a  23-second 
period  for  the  180°  beam,  and  4.2  dB  at  a  21 -second  period  for  the  270°  beam. 
I-igures  VI-16  through  VI-21  graph  beam  spectra  and  noise  reduction  as  a 
function  of  frequency,  respectively,  for  the  0°,  90°,  and  180°  beams.  Since 
noise  reduction  on  the  270  beam  seldom  differs  by  more  than  0.  1  dB  f>-oni 
that  shown  in  Figure  IV-6,  no  new  illustrations  have  been  generated  for  that 


VI-  1  7 


Beamateer  Output 


•  -  i  >-»  »»  -*  V/  S.  f 


Sf  '.i^f  fJr  <; 

?.(}  i.Qb.Ob.  *■' 


bf.Wr  1.)  I 
?0  '<  Wu  11.^ 


*3t  r 


n 


100 

tt^i 


Adaptive  Filter  Output 


r — •■  — 


■•■4 


'\/- 


— - 1  * 


FIGURE  VI. 13 

QUIESTEST  NOISE  SECTION  BETWEEN  0355  and  0725 
(DAY  203  1971,  STEER  DIRECTION  270°,  K#  =  0.  005  ) 


VI-18 


Beamsteer  Output 


V\/fW /v  A.- 


c-wr  39 

20  3#  00.  30. 


Of. GW.?P  90 

203.00.O 


too 

iryt 


Adaptive  Filter  Output 


/'•>  WV«— ^ 


‘ \  '  * V“'*'r\  f  V\  /  "  'T  v  A  i\f  y/  \  . 


FIGURE  VI-14 

NEW  IRELAND  EVENT  (ORIGIN  TIME  05:53:41.4) 
(DAY  203  1971.  STEER  DIRECTION  270°,  Kg  =  0.  005) 


VI-19 


MAGNITUDE  5.4  NORTHERN  CELEBES  EVENT 
(DAY  203  1971,  STEER  DIRECTION  270°,  K  =0.005) 


Period  (Seconds) 


64  32  16  8 


l  i  i  I  l  l  I 

0.02  0.04  C.  06  0.08  0.10  0.12  0.14 

Frequency  (Hz) 


FIGURE  VI-17 

ADAPTIVE  FILTER  NOISE  REDUCTION  AS  A  FUNCTION  OF  FREQUENCY 
(DAY  203  1971,  STEER  DIRECTION  0°,  =  0.  005) 


VI-22 


64  32 


8 


Period  (Seconds) 
16 


FIGURE  VI- i  8 

BEAMSTEER  AND  ADAPTIVE  FILTER  OUTPUT  POWER  DENSITY 

(DAY  203  1971,  STEER  DIRECTION  90°,  K  =  0.  005) 

s  7 


VI-23 


0.02  0.04  0.06  0.08  0.10  0.12  0.14 


F requency  (Hz ) 

FIGURE  VI-20 

BEAMSTEER  AND  ADAPTIVE  FILTER  OUTPUT  POWER  DENSITY 
(DAY  203  1971,  STEER  DIRECTION  180°,  K  =  0.0051 


Period  (Seconds) 


64  32  16  8 


FIGURE  VI-21 

ADAPTIVE  FILTER  NOISE  REDUCTION  AS  A  FUNCTION  OF  FREQUENCY 
(DAY  203  1971,  STEER  DIRECTION  180°.  K  =  0.  005) 


beam.  The  beams  aimed  toward  0°  and  90°  possess  the  least  power:  spectral 

peaks  in  the  20-to-36  second  band  are  comparable  to  the  height  of  the  principal 

microseismic  peak  at  18  seconds.  The  main  peaks  at  20  seconds  on  the  130° 

beam  and  at  21  seconds  on  the  27  0°  beam  are  about  5  dB  higher.  Note  that  the 

180°  beamsteer  output  contains  less  power  at  periods  of  21  seconds  and  above 

tnan  the  270  beam.  The  relatively  high-frequency  power  on  the  180°  beam 

probably  comes  from  the  two  New  Ireland  events.  Since  most  of  the  energy  is 

this  sample  arrives  at  apparent  azimuth  ranging  from  210°  to  270°,  the  highest 

noise  reduction  peaks  for  the  180°  and  270°  beams  may  be  attributed  to  super- 

directivity  effects  on  coherent  seismic  events.  The  greater  noise  reduction  on 
o  o 

the  0  beam  is  due  to  the  fact  that  the  90  beamsteer  output  contains  less  co¬ 
herent  energy  to  eliminate. 

C.  VARIABILITY  OF  SIGNAL-TO-NOISE  GAIN 

Table  VI-2  furnishes  the  noise  reduction  values  for  each  of  the  ten 
four-hour  samples  processed  for  this  report.  Figure  VI-22  provides  a  plot  of 
these  noise  reduction  values  together  with  the  corresponding  signal-to-noise 
gains  for  the  weak  signal  from  300°  -  305°  (day  276),  which  is  6  dB  above  noise 
level  on  the  beamsteer  output  (see  Subsection  IV-D).  Sample  I  suggests  that 
1.2  dB  broadband  and  1.  3  dB  narrowband  signal-to-noise  gain  can  be  expected 
from  pure  summer  background  noise.  Sample  J  intimates  that  the  correspond¬ 
ing  gains  for  winter  background  noise  are  1.4  dB  and  1.  7  dB,  respectively. 
Cases  A  through  D  near  the  peak  fall  noise  level  of  1971  on  day  321  illustrate 
the  variation  in  noise  reduction  as  steer  direction  changes.  Superdirectivity 
effects  account  for  the  sustained  2.  7  dB  and  3.  7  dB  gains  on  the  180°  beam. 
Since  the  filter  set  was  frozen  seven  times  on  the  90°  beam,  much  of  the  energy 
is  very  close  to  the  look  direction  and  adaptive  processing  produces  signal-to- 
noise  gains  of  only  1.  0  dB  broadband  and  1. 2  dB  over  the  15 -to-43-second  band. 
On  the  270°  beam,  the  gains  achieved  are  1.  1  dB  and  2.  2  dB.  In  this  case, 


VI-27 


adaptive  filtering  was  able  to  shape  the  array  beam  pattern  so  as  to  reduce 
significantly  the  microseis mic  energy  from  1  J8°  to  132°.  The  0°  adaptive- 
filter  beam  was  not  nearly  so  successful  ana  yielded  gains  of  0.  8  dB  and  1,3 
dB.  A  possible  explanation  is  the  orientation  of  the  main  lobe  of  the  array 
wavenumber  response  along  a  NNW-SSE  axis. 


Cases  E  through  H  demonstrate  the  signal-to-noise  gain  achievable  at 
the  steer  directions  0°,  90°,  180°,  and  270°  when  several  events  propagate 
across  ALPA  at  apparent  azimuths  ranging  from  210°  to  270°.  Superdirec¬ 
tivity  accounts  for  the  2,  0  dB  and  2.  1  dB  gains  on  the  180°  beam  as  well  as 
the  2.  0  dB  and  2.  2  dB  gains  on  the  270  beam.  Some  of  the  1.8  dB  and  2.  0  dB 
gain  on  the  0°  beam  can  be  explained  in  the  same  way.  In  contrast,  signal-to- 

noise  gains  of  only  1. 0  dB  (both  broadband  and  narrowband)  wer,  obtained  on 
the  90  beam. 


These  results  imply  that  signal-to-noise  ratio  improvement  of  one  to 
two  dB  can  be  preserved  over  four-hour  noise  samples.  In  one  case  involving 
superdirectivity,  broadband  gain  was  2.  7  dB,  narrow  band  gain  3.  7  dB.  Of 
course,  as  was  amply  demonstrated  in  Section  V,  even  greater  signal-to-noise 
improvement  can  be  achieved  on  off-aximuth  seismic  events  of  brief  duration. 


VI-30 


SECTION  VII 


.  . .  J 


CONCLUSIONS 


Floating  DC  levels  in  the  data  channels  transmitted  from  ALPA 
caused  considerable  difficulty  in  implementing  an  adaptive  filtering  system 
until  their  effect  was  studied  theoretically  and  effective  remedial  action  taken. 
Two  steps  were  necessary: 

•  The  data  traces  were  run  through  a  filter  having  a  response 
exactly  equal  to  zero  at  DC. 

•  The  adaptive  filtering  program  was  examined  to  uncover  DC 
bias  introduced  by  the  computations.  Bias  compensation  was 
incorporated  into  the  program  and  intermediate  results  were 
rounded  instead  of  truncated  whenever  possible. 

Roundoff  error  in  the  adaptive -filte r  update  equation 


.new  old 
A  =  A  + 


nr  r\  1 H 

ZK  X  A  (X  -  X) 
_ S - 

(  X  -X)  T  (X  -  X) 


almost  became  a  serious  problem  in  obtaining  the  results  of  this  report.  When 
the  data  points  were  scaled  by  a  factor  of  16,  however,  error  in  the  input  chan¬ 
nels  to  the  adaptive  filter  was  reduced  to  the  point  where  the  data  vector  X  and 
the  beamsteer  output  vector  X  were  almost  as  accurate  as  the  corresponding 
vectors  computed  with  floating-point  arithmetic.  In  one  noise  sample,  this 


VII- 1 


scaling  procedure  increased  the  noise  reduction  of  adaptive  filtering  relative 

to  beamsteering  from  2  dB  to  6  dB  at  the  convergence  rate  K  =  0.  30.  The 

2-dB  noise  reduction  figure  foi  unsealed  data  is  apparently  due  to  sign  re- 

T 

versals  in  the  adaptive  filter  output  y(t)  =  X  A  (so  that  the  adaptive  filter 
vector  A  moves  in  the  wrong  direction)  and  to  zero  values  y(t)  after  round¬ 
off  (so  that  the  adaptive  filter  vector  does  not  move  at  ail).  With  data  scaled 
by  16,  errors  in  the  vector  (X  -  X)  and  the  adaptive  filter  output  y(t)  were  pre¬ 
dominantly  digitization  errors.  The  sensor  gain  doubling  at  AL.PA  in  the  sum¬ 
mer  of  1972  should  permit  these  errors  to  be  halved.  At  the  most  frequently 
employed  convergence  rates  (near  K  =0.  005),  the  dominant  source  of  error 
was  the  error  in  rounding  the  updated  filter  weights  (on  the  right  side  of  the 

update  equation)  to  the  nearest  filter -weight  count.  The  mean  angle  of  error 
,  .  new  .old,  ,  .  no  , 

in  the  vector  (A  -  A  )  was  estimated  as  18  for  one  summer  noise  sample 

at  the  convergence  rate  K  -  0.  005.  This  angle  could  have  been  reduced  to  10 

s 

if  errors  in  the  maximum-likelihood  constraint  conditions  had  been  corrected 
differently.  As  the  convergence  rate  drops  below  0.5%,  the  filter -weight 
roundoff  error  becomes  progressively  worse  until  ultimately  the  filter  vector 
A  cannot  change.  The  way  to  improve  this  situation  is  to  incorporate  more 
bits  into  the  filter -weight  representation.  Such  a  solution  would  have  meant 
abandoning  the  special  convolution -filter  microcode  instruction  incorporated 
in  the  IBM  360/40  computers  at  SDAC,  where  a  16-bit  filter -weight  repre¬ 
sentation  is  required  for  the  CF1L  microcode.  Had  filter-weight  roundoff 
error  been  eliminated,  the  effect  would  have  been  to  reduce  (probably  only 
slightly)  the  convergence  rate  at  which  the  highest  adaptive -filtering  signal- 
to-noise  gains  relative  to  beamsteering  were  achieved. 

In  determining  adaptive- filtering  signai-to-noise  gains,  the  critical 
area  of  concern  is  the  processing  improvement  for  weak  signals  in  the  bor¬ 
derline  detection  range,  where  the  signal-to-noise  ratio  on  the  beamsteer 
output  is  between  6  anu  12  dB.  With  sue!  weak  signals,  detection  procedures 
cannot  consistently  recognize  the  presence  of  a  signal,  and  no  filter -freeze 
procedure  can  be  implemented.  One  signal  approximately  6  dB  a  ,>ve 


V1I-2 


the  noise  level  on  the  beamsteer  output  was  used  for  the  critical  results. 
Signal-to-noise  gain  was  measured  as  the  difference  between  adaptive -fil¬ 
tering  noise  reduction  and  signal  degradation.  Optimum  gain  was  realized 
near  a  0.  5%  convergence  rate.  Using  noise  data  from  day  238  of  1970,  signal- 
to-noise  gain  for  the  weak  signal  was  1.23  dB  broadband,  1.34  dB  in  the  fre¬ 
quency  band  for  periods  between  43  and  15  seconds.  With  noise  data  from 
day  203  of  1971,  broadband  gain  was  2.  03  dB,  narrowband  gain  2.  19  dB. 

Due  to  greater  degradation  of  stronger  signals,  the  signal-to-noise  gain  was 
lower  for  signals  18  to  24  dB  above  the  noise  level  on  the  beamsteer  output 
when  the  adapti/e  filter  set  was  permitted  to  update. 

With  signals  as  strong  as  these,  it  is  easy  to  detect  their  presence. 

A  scaled  version  of  the  Fisher  detection  algorithm  was  used  for  this  purpose. 

With  the  particular  adaptive  algorithm  employed,  the  standard  procedure  is 
to  freeze  the  adaptive  filter  set.  When  the  filter  set  was  prevented  from  up¬ 
dating  upon  signal  detection,  signal  degradation  was  less  than  0.  1  dB  for  a 
signal  18  dB  above  the  noise  level  on  the  beamsteer  output  and  almost  exactly 
0  dB  for  a  signal  with  a  beamsteer -output  signal-to-noise  ratio  of  24  dB.  These 
figures  are  contrasted  with  a  signal  degradation  of  0.41  dB  for  the  6-dB  signal 
at  a  convergence  rate  of  0.  5%.  Although  the  signal  degradation  is  lower  for  the 
two  strong  signals  when  the  filter  is  frozen,  noise  reduction  begins  to  drop  as 
the  elapsed  time  from  the  point  of  the  filter  freeze  increases.  Loss  in  noise 
reduction  '/as  measured  by  twice  processing  a  noise  sample  from  day  232  of 
1970.  In  both  cases,  the  filter  was  permitted  to  adapt  for  the  first  three  hours 
of  the  noise  sample.  In  the  first  computer  run,  it  was  allowed  to  adapt  for  one 
more  hour.  In  the  second  run,  it  was  frozen  during  the  final  hour.  The  apparent 
trend  of  the  loss  in  noise  reduction  indicates  that  higher  signal-to-noise  gain 
is  preserved  for  at  least  15  minutes  by  freezing  the  filter  rather  than  updating 
it  in  the  case  of  the  18-dB  signal.  Superior  gain  is  maintained  much  longer 
for  the  24-dB  signal. 


VI1-3 


The  alternate  adaptive  algorithm 


.new  .old 
A  =  A  + 


2K  X1  A°ld  (X  -  X) 
s _ ' _ 

T 

X  X 


adapts  much  less  rapidly  in  the  presence  of  a  signal.  It  is  very  possible  that 
this  algorithm  could  operate  at  convergence  rates  higher  than  the  value  K  =0.  005 
without  degrading  signals  any  more  than  the  implemented  algorithm  does  at  its 
optimum  rate  of  Kg  =  0.  005.  If  so,  greater  signal-to-noise  gains  could  be  re¬ 
alized  through  increased  noise  reduction  at  higher  convergence  rates. 

One  data  sample  from  2000  to  2357  on  day  276  of  1971  was  especially 
rich  in  signals  (both  on-azimuth  and  off -az imuth).  The  following  conclusions  can 
be  drawn  from  this  four -hour  data  sample: 

•  Off-azimuth  events  are  strongly  suppressed  in  the  adaptive - 
filter  beam  when  the  filter  set  is  not  frozen.  Some  off-azi¬ 
muth  signals  are  virtually  annihilated.  The  more  powerful 
the  off-azimuth  event,  the  more  it  is  stifled. 

•  Greater  directional  resolution  at  ALPA  and  other  similar 
long-period  arrays  is  achievable  through  the  beam-narrow¬ 
ing  capability  of  multichannel  filtering.  This  fact  is  extremely 
important  if  long-period  arrays  of  this  type  are  to  be  used  for 
signal  detection  and  location  or  for  separation  of  multiple  events. 
With  time-varying  adaptive  filters,  off-azimuth  events  can  be 
nulled  out  in  proportion  to  Mieir  signal-to-noise  ratio  with 
possible  complications  if  two  events  overlap  in  time.  With 
fixed  non-varying  multichannel  filters,  superdirectivity  can 

be  preserved  in  all  circumstances,  but  then  the  ability  to 
quell  specific  bursts  of  off-azimuth  energy  in  an  on-line  pro¬ 
cessing  mode  is  impaired. 


VII-4 


Four  four-hour  noise  samples  were  processed  for  this  report.  Adap¬ 
tive  filter  beams  were  steered  toward  four  different  look  directions  in  pro¬ 
cessing  two  of  these  noise  samples.  In  eight  out  of  ten  cases,  the  broadband 
signal-to-noise  gain  which  would  have  achieved  for  the  weak  6-dB  signal  was 
within  the  range  0.  98  to  2.  02  dB.  In  one  case,  it  would  have  been  0.  75  dB; 
in  another,  it  would  have  been  2.7  dB.  Over  the  band  0.  0234  to  0.  0664  Hz 
(corresponding  to  periods  between  43  and  15  seconds),  the  signal-to-noise 
gain  for  the  weak  signal  would  have  been  between  0,  98  and  2.  0  dB  in  six  cases, 
between  2.  0  dB  and  2.  2  dB  in  three  cases.  The  last  case  would  have  yielded 
a  narrowband  gain  of  3.74  dB.  The  narrowband  values  are  meaningful  if  a 
bandpass  filter  for  periods  between  40  and  15  seconds  is  applied  to  the  data. 


VII-5 


SECTION  VIII 


REFERENCES 


Biandford,  R.  R.  ,  1970,  "An  Automatic  Event  Detector  at  TFO",  Seismic 
Data  Laboratory  Report  No.  263,  Teledyne  Geotech. 

Biandford,  R.  R.  ,  1972,  "Qualitative  Properties  of  the  F-Detector",  Seismic 
Data  Laboratory  Report  No.  291,  Teledyne  Geotech. 

Booker,  A.  H.  ,  1965,  "Analysis  of  Variance  as  a  Method  for  Seismic  Signal 
Detection",  Seismic  Data  Laboratory  Report  No.  216,  TeLdyne  UED. 

Daniell,  T.  P.  ,  1968,  "Adaptive  Estimation  with  Mutually  Correlated  Training 
Samples",  Stanford  Electronic  Laboratories,  Stanford,  California, 
Document  SEL-68-083  (Technical  Report  TR  6778-4). 

Edwards,  J.  P.  Ill,  S.  A.  Benno,  and  G.  Creasey,  1967,  "Evaluation  of  the 
CPO  Auxiliary  Processor",  CPO  Special  Report  No.  5,  Texas  Instr  i- 
ments  Incorporated. 

Frast,  O.  L.  Ill,  1972,  "An  Algorithm  for  Linearly  Constrained  Adaptive 

Array  Processing  ",  Proceedings  of  the  IEEE,  Volume  60,  pp,  926-935. 

Gnedenko,  B.  V.,  1962,  The  Theory  of  Probability,  Chelsea  Publishing  Com¬ 
pany,  New  York,  pp.  1  32-135. 

Melton,  B.  S.  ,  and  L.  F.  Bailey,  1957,  "Multiple  Signal  Correlators", 
Geophysics,  Volume  22,  No.  3,  pp.  565-588. 

Shumway,  R.  H.  ,  and  L.  L.  Husted,  1970,  "Frequency-Dependent  Estirra  tion 
and  Detection  for  Seismic  Arrays",  Seismic  Data  Laboratory  Report 
No.  249,  Teledyne  Geotech. 


VIII- 1 


Shumway,  R.  H.  ,  1971,  "On  Detecting  a  Signal  in  N  Stationarily  Correlated 
Noise  Series",  Technometrics,  Volume  13,  pp.  499-520. 

Shumway,  R.  H.  ,  1972,  "Some  Applications  of  a  Mixed  Signal  Processor", 
Seismic  Data  Laboratory  Report  No.  280,  Teledvne  Geotech. 

Smart,  E.  ,  and  E.  A.  Flinn,  1971,  "Fast  Frequency- Wavenumber  Analysis 
and  Fisher  Signal  Detection  in  Real-Time  Infrasonic  Array  Data  Pro¬ 
cessing",  Geophysical  Journal  of  Royal  Astronomical  Society. 
Volume  26,  pp.  279-284. 

Smart,  E.  ,  1972,  "FKCOMB,  A  Fast  General-Purpose  Array  Processor ", 
Seismic  Array  Analysis  Center  Report  No.  9,  Teledyne  Geotech. 

Widrow,  B.  ,  1966,  "Adaptive  Filters  I:  Fundamentals ",  Stanford  Electro  - 

nics  Laboratories,  Stanford,  California,  Report  SEL-66- 126  (Tech¬ 
nical  Report  6764-6). 

Wirth,  M.  H.  ,  R.  R.  Blandford,  and  R.  H.  Shumway,  1971,  "Automatic 

Network  Detection",  Seismic  Data  Laboratory  Report  No.  285,  Tele¬ 
dyne  Geotech. 

Wirth,  M.  H.  ,  1971,  "Computation  of  Multiple-Event  Probabilities ",  Seismic 
Data  Laboratory  Report  No.  277,  Teledyne  Geotech. 


VIII-2 


UNCLASSIFIED _ 


SECURITY  CLASSIFICATION  of  THIS  PAGE  (When  Data  Entarad) 


REPORT  DOCUMENTATION  PAGE 


1  REPORT  NUMBER 


READ  INSTRUCTIONS 
BEFORE  COMPLETING  FORM 


2  GOVT  ACCESSION  NO.  3.  R  ECIPIE  NT'S  C  AT  ALOG  NUMBE  R 


4  TITLE  (anc  Subtitle) 

SIMULATED  ON-LINE  ADAPTIVE  PRO¬ 
CESSING  RESULTS  USING  ALASKA  LONG 
PERIOD  ARRAY  DATA 


7  AUTHORf*) 

Thomas  E.  Barnard 


S  TYPE  OF  REPORT  A  PERIOD  COVERED 


Special 


6  PERFORMING  ORG.  REPORT  NUMBER 


e.  contract  or  grant  number/*; 

F33657-72-C-0725 


9.  PERFORMING  ORGANI  ZATION  NAME  AND  ADDRESS 

Texas  Instruments  Incorporated 

Equipment  Group 

Dallas.  Texas  75222 


'I  CONTROLLING  OFFICE  NAME  AND  ADDRESS 

Advanced  Research  Projects  Agency 
Nuclear  Monitoring  Research  Office 
_ Arlington.  Virginia  22209 


l«  MONITO  1ING  AGENCY  NAME  »  ADDRESS?"/  /  dllletent  Itom  Controlling  Ollice) 

Air  Force  Technical  Applications  Center 
VELA  Seismoiogical  Center 
Alexandria,  Virginia  22314 

16.  DISTRIBUTION  STATEMENT  (ol  thlt  Report) 


10  PROGRAM  ELEMENT,  PROJECT,  TASK 
AREA  6  WORK  UNIT  NUMBERS 


VELA  T/2705/B/ ASD 


12.  REPORT  DATE 

23  October  1973 


13.  NUMBER  OF  PAGES 

-W  299 


15.  SECURITY  CLASS,  (of  thlt  report) 

UNC  LASSIF1ED 


IS*  DECL/ SSI  FI  CATION  DOWNGRADING 
SCHEDULE 


APPROVED  FOR  PUBLIC  RELEASE;  DISTRIBUTION  UNLIMITED. 


17.  DISTRIBUTION  STATEMENT  (of  the  abstract  entered  In  Block  20,  if  different  from  Report ) 


10  SUPPLEMENTARY  NOTES 


ARPA  Order  No.  1714 


19.  KEY  WORDS  (Continue  on  reverte  aide  If  necessary  and  identify  by  block  number) 

Adaptive  multichannel  filtering 

Seismic  signal  processing  Reproduced  by 

r  j  fr  K  NATIONAL  TECHNICAL 

Roundoff  error  INFORMATION  SERVICF 

Alaska  Long-Period  Array  u  s  Department  of  Commerce 

'  '  Springfield  VA  2?151 


20.  ABSTRACT  (Continue  on  reverse  side  If  necessary  and  identify  by  block  number) 

This  report  deals  with  results  obtained  from  operating  an  adaptive 
time-domain  maximum-likelihood  filtering  system  on  data  from  the  Alaska 
Long-Period  Array  (ALP A).  Signai-to-noise  gain  of  adaptive  filtering  re¬ 
lative  to  beamsteering  is  investigated  as  a  function  of  convergence  rate  and 
steer  direction.  In  addition,  the  effect  upen  signal-to-noise  gain  of  freezing 
the  adaptive  filter  set  is  described.  Both  on-azimuth  and  off-azimuth  sig. 
nals  are  examined  to  determine  how  much  they  are  attenuated  in  the  adaptive 


DD  I  JANM73  1473  EDITION  OF  I  NOV  6S  IS  OBSOLETE  .  UNCLASSIFIED  >  7  ^ 

s E CURITY  CLASSIFICATION  OF  THIS  PAGE  Jw) Ten  Data  Entered) 


SECURITY  CLASSIFICATION  OF  TMI»  PAOEfWian  Da*a  *nl»f#d; 


20.  continued 


filter  beam  in  comparison  with  the  beamsteer  output.  Signal-to-noise 
gain  values  presented  are  measured  using  beam  output  traces  formed 
from  actual  seismic  data.  Theoretical  studies  of  the  effect  of  floating 
DC  levels  and  roundoff  error  are  also  contained  in  this  report. 

A  major  advantage  of  adaptive  multichannel  filtering  is  found  to 
be  the  capability  to  narrow  the  main  lobe  of  the  array  beam  pattern. 


UNCLASSIFIED 


SECURITY  CLASSIFICATION  OF  THIS  PAGE(T«i»n  Dal*  Enfr'd) 


